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(54) Title: ADENOVIRUS PROTEIN IX, ITS DOMAINS INVOLVED IN CAPSID ASSEMBLY, TRANSCRIPTIONAL ACTIV- 
ITY AND NUCLEAR REORGANIZATION 

(57) Abstract: Described are adenovirus pIX proteins which are modified by mutating one or more amino acids and/or by the 
inclusion of a binding moiety. Preferably, said modification is carried out in the N-lerminal part or in the C-lerminal leucine-repeat 
of the pIX protein. It is described that viruses or virus-like particles containing such a modified pIX protein show an improved 
gene delivery efficiency. Furthermore, described are corresponding adenoviral vectors, viruses or virus-like particles, host cells, 
complementation cell lines and methods for producing such viruses or virus-like particles. In addition, described are pharmaceutical 
compositions comprising an adenoviral vector, virus or virus-like particle, host cell or complementation cell line as mentioned above 
and therapeutical applications thereof. 
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5 Adenovirus protein IX , its domains involved in capsid assembly, transcriptional 

activity and nuclear reorganization 

The invention relates to new recombinant and mutant adenovirus pIX polypeptides 
and nucleic acid sequences encoding them, to new vectors comprising said nucleic acid 

10 sequence, and to virus or virus-like particles, in particular adenoviral or pseudo adenoviral 
particles comprising said pIX polypeptide or said nucleic acid sequence. Furthermore, the 
invention relates to a method for preparing an adenoviral or pseudo adenoviral particle; to 
cells and compositions comprising said polypeptide, nucleic acid sequence, virus, particle or 
pseudo particle; to compositions comprising said cells as well as to therapeutic or 

15 prophylactic uses. The invention is of very special interest with regard to prospects for gene 
therapy or vaccination using gene transfer, in particular in human. 

Gene therapy, and more widely gene vaccination, can be defined as the transfer of 
genetic material into a cell or an organism to treat or prevent a genetic or acquired disease, or 
to circumvent molecular, cellular or organ disorders. The possibility of treating human 

20 diseases or disorders by gene therapy has changed in a few years from the stage of theoretical 
considerations to that of clinical applications. The first protocol applied to man was initiated 
in the USA in September 1990. It concerned a patient who was genetically immunodeficient 
as a result of a mutation affecting the gene encoding adenine deaminase (ADA). This first 
encouraging experiment has been followed by numerous new applications and promising 

25 clinical trials based on gene therapy which are currently ongoing (see for example clinical 

trials listed at http://cnetdb.nci.nih.gov/trialsrch.shtml or 

http://www.wiley.co.uk/genetherapy/clinical/). 

Successful gene therapy principally depends on the efficient delivery of a therapeutic 
gene of interest to make its expression possible in cells of a living organism. Therapeutic 
30 genes can be transferred into cells using a wide variety of vectors resulting in either transient 
expression or permanent transformation of the host genome. During the past decade, a large 
number of viral, as well as non-viral, vectors has been developed for gene transfer (see for 
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example Robbins et al., 1993, Tibtech 16, 35-40 and Rolland, 1998, Therapeutic Drug Carrier 
Systems 15, 143-198 for reviews). 

Most of the intracellular gene delivery mechanisms used to date relate to the first 
category, i.e. viral vectors, especially adeno- and retroviral vectors. Viruses have developed 
5 diverse and highly sophisticated mechanisms to achieve transport across the cellular 
membrane, escape from lysosomal degradation, delivery of their genome to the nucleus and, 
consequently, have been used in many gene delivery applications. Their structure, 
organization and biology are described in the literature available to a person skilled in the art. 

One of the most widely used vectors for in vivo gene transfer is a replication-deficient 
10 adenoviral vector. Originally, adenoviruses have been detected in many animal species. They 
are non-integrative and not very pathogenic. They are able to infect a variety of cell types, 
dividing as well as quiescent cells. They have a natural tropism for airway epithelia. In 
addition, they have been used as live enteric vaccines for many years with an excellent safety 
profile. Finally, they can be grown easily and purified in large quantities. These features have 
15 made adenoviruses particularly appropriate for use as gene therapy vectors for therapeutic and 
vaccine purposes. A number of adenoviruses are now well characterized genetically and 
biochemically. This is, for example, the case with human adenovirus type 5 (Ad5), the 
sequence of which is disclosed in the Genbank data bank under reference M73260 (see Figure 
6). 

20 The adenoviral genome consists of a linear double-standed DNA molecule of 

approximately 36kb carrying more than about thirty genes necessary to complete the viral 
cycle. The early genes are divided into 4 regions dispersed in the adenoviral genome (El to 
E4) which altogether contain 6 transcription units directed by their own promoters. The El, 
E2 and E4 regions are essential for viral replication whereas the E3 region, which is believed 

25 to modulate the anti-viral host immune response, is dispensable for viral growth in vitro. The 
late genes (LI to L5) encode in their majority the structural proteins constituting the viral 
capsid. They overlap at least in part with the early transcription units and are transcribed from 
a unique promoter (MLP, for Major Late Promoter). In addition, the adenoviral genome 
carries at both extremities cis-acting regions essential for DNA replication. These are the 5* 

30 and 3' TTR (Inverted Terminal Repeat) and a packaging sequence following 5' ITR. 

The El early region is located at the 5' end of the adenoviral genome, and contains 2 
viral transcription units, El A and E1B, respectively. This region codes for proteins which 
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participate very early in the viral cycle and are essential to the expression of almost all the 
other genes of the adenovirus. In particular, the E1A transcription unit codes for a protein 
which trans activates the transcription of the other viral genes, in particular by inducing 
transcription from the promoters of the E1B, E2A, E2B and E4 regions. 

5 The products of the E2 region, which also comprises two transcription units, E2A and 

E2B, are directly involved in the replication of the viral DNA. This region governs, in 
particular, the synthesis of a 72 kDa protein, which displays a strong affinity for single- 
stranded DNA, and of a DNA polymerase. 

The E3 region is not essential to the replication of the virus. It codes for at least six 
10 proteins which appear to be responsible for the inhibition of the host's immune response to an 
adenovirus infection. In particular, the gpl9kDa glycoprotein appears to prevent the CTL 
response which is responsible for the cytolysis of infected cells by the host's cytotoxic T cells. 

The E4 region is believed to be involved in viral DNA replication, late mRNA 
synthesis, viral assembly and the shut off of host protein synthesis. It is a complex 

15 transcription unit which encodes a variety of polypeptides. Those encoded by the open 
reading frames (ORFs) 6 and 7 are assumed to compete with the cellular RB protein for 
binding to the E2F transcription factor, thereby conferring a transactivating function . The 
expression product of ORF4 is able to bind and regulate the cellular phosphatase 2A to 
modulate the activity of viral (E1A) and cellular transcription factors. The polypeptides 

20 encoded by ORFs 3 and 6 are essential to viral growth because of their capability to maturate 
the primary 28 kb transcript derived from the adenoviral genome and to promote its export 
into the cytoplasm. Their absence may be complemented in trans to allow viral growth. In 
addition, the ORF6 polypeptide interacts with the E1B encoded 55K polypeptide to fomi a 
complex that facilitates the cytoplasmic accumulation of late messengers at the expense of 

25 cellular mRNA. 

The adenoviral vectors presently used in gene therapy protocols lack most of the El 
region in order to avoid their dissemination in the environment and the host body. Additional 
deletions in the E3 region allow to increase the cloning capacity. The gene of interest is 
introduced into the viral DNA in place of a deleted region. The feasibility of gene transfer 
30 using these vectors designated "first generation" has been demonstrated in a number of cases. 
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Further constructs ("second generation vectors 95 ) have been made that retain the cis 
regions necessary for viral replication (ITRs and packaging sequences) and contain substantial 
genetic modifications with the aim to abolish residual synthesis of viral antigens. The antigens 
have been postulated to be responsible for the stimulation of inflammatory responses (see for 
5 example the international application W094/28152 or US 5,670,488 which discloses 
adenoviral vectors having E4 sequences partially deleted with the exception of ORF3 or 
ORF6/7 that do not need E4 complementation). A minimal vector deficient in all adenoviral 
functions can also be considered. For a general revue, one can refer to Lusky et al. J. Virol 72 
(1998) 2022-2032). 

10 Finally, the infectious cycle of the adenovirus takes place in 2 steps: 

the early phase which precedes initiation of replication of the adenoviral 
genome, and which permits production of the regulatory proteins 
participating in the replication and transcription of the viral DNA, and 

once replication of the viral DNA has been initiated, transcription of the 
15 late genes begins. 

The late genes occupy the majority of the adenoviral genome and partially overlap the 
transcription units of the early genes. However, they are transcribed from different promoters 
and according to an alternative mode of splicing, so that the same sequences are used for 
different purposes. Most of the late genes are transcribed from the major late promoter 

20 (MLP). This promoter permits the synthesis of a long primary transcript, which is then 
matured into about twenty messenger RNAs (mRNAs) from which the capsid proteins of the 
virion are produced. The gene coding for structural protein IX (pIX) of which the capsid is 
composed is located at the 5' end of the adenoviral genome and overlaps the E1B region at its 
3 1 end. The protein IX transcription unit utilizes the same transcription termination signal as 

25 the E1B transcription unit. 

pIX is a small polypeptide of 140 amino acid residues (14.3 kDa) that is incorporated 
into the viral particle or pseudo particle, or capsid. More specifically, said pIX polypeptide is 
associated with hexon proteins to form group-of-nine hexons (GON) that make up the central 
region of each facet of the icosahedron (Boulanger et al., 1979, J. General Virology, 44, 783- 
30 800 ; Burnett, 1985, J. Molecular Biology, 185, 125-143 ; Burnett et al., 1985, J. Molecular 
Biology, 185, 105-123). Precise determination of the stochiometry of this assembly has 
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revealed that there are 12 molecules of pIX, organized as four trimers per GON, and therefore 
240 molecules per capsid (Steward et al., 1991, Cell, 67, 145-154). The pDC polypeptide acts 
as a capsidic cement and thereby enhances thennal stability of the viral particles (Colby et al, 
1981, J. Virol., 39, 977-980 ; Furcinitti et al., 1989, EMBO J., 8, 3563-3570). However, while 
it is well established that pIX is essential for building the adenoviral capsid, no result is yet 
available permitting to precisely delineate the "minimal" pEX sequence which is actually 
necessary for obtaining correct encapsidation of the adenoviral genome, or at least for the 
formation of particles (whether or not containing the adenoviral genome ). 

Additionally, observations have strongly suggested that pIX is furthermore involved in 
the infectious cycle (Lutz et al., 1997, J. of Virology, 71, 5102-5109) : (i) the gene encoding 
pIX is the only gene encoding a structural protein which is not expressed under the control of 
the adenoviral major late promoter (MLP) , (ii) its expression pattern follows a time course 
different from that of the other structural proteins and begins at intermediate times post- 
infection (p.i.), and actually much earlier than that of all the other structural proteins; (iii) 
finally, pIX accumulates in the nuclei of infected cells with a speckled distribution. In 
agreement with this nuclear localization, Lutz et al. (1997, Journal of Virology 71, 5102-9) 
have previously shown that pIX is a transcriptional activator of several viral and cellular 
TATA-containing promoters, among which are the genes controlled by the El a, E4 and 
MUP promoters. 

The design of viral gene therapy vectors which are capable to deliver therapeutic 
genes to a specific cell represents one of the main interest and challenge in today's gene 
therapy research. The use of targeting vectors would limit the vector spread, thus increasing 
therapeutic efficacy for the desired target cells and minimizing potential side effects. 

The broad tissue tropism of adenoviruses may turn disadvantageous when genes 
encoding potentially harmful proteins (e.g. cytokines, cytotoxic proteins, suicide gene 
products) are expressed in surrounding normal tissues. Moreover, the overall in vivo 
efficiency of gene delivery might be reduced by a significant dilution of the virus in the 
organism due to the transduction of non-target cells. The development of adenovirus vectors 
with targeted infectivity capacities would therefore greatly improve the safety and efficacy of 
some current gene therapy strategies. Thus, targeting adenoviral vectors may improve gene 
therapy procedures by providing augmented infectivity of poorly transduced cells (e.g. tumor 
cells) and decreased toxicity to normal tissues. 
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The specificity of infection of the adenoviruses is determined by the attachment of the 
virions to cellular receptors present at the surface of permissive cells. In this regard, the fiber 
present at the surface of the viral capsid plays a critical role in cellular attachment (Defer et al. 
J. Virol. 64 (1990) 3661-3673) and penton-base promotes internalization through the binding 
5 to the cellular integrins (Mathias et al. J. Virol. 68 (1994) 6811-6814). Recent studies have 
presumed the use of the coxsackie virus receptor (CAR) by type 2 and 5 adenoviruses 
(Bergelson et al; Science 275 (1997) 1320-1323). In particular, the initial attachment of the 
adenovirus particle to the cell surface is mediated by the binding of the knob region of the 
viral fiber protein to the ubiquitous CAR (Bergelson et al, 1997, Science 275, 1320-1323; 

10 Tomko et al, 1997, Proc. Natl. Acad. Sci. USA 94, 3352-3356) .CAR was identified as the 
primary receptor for adenovirus serotype C fibers (e.g. Ad5) and the cell-surface heparan 
sulfate glycosaminoglycans (HSG) which was recently shown to interact with Ad5 fiber and 
facilitate virus binding to cells (Dechecchi et al, 2000, Virology 268, 382-390 ; Dechecchi et 
al, 2001, J. Virol. 75, 8772-8780). However, other surface proteins may also be involved in 

15 fiber attachment, for example, the alpha2 domain of the class I histocompatibility antigens as 
identified by Hong et al. (EMBO J. 16 (1997) 2294-2306). The fiber is composed of 3 regions 
(Chroboczek et al. Current Top. Microbiol. Immunol. 199 (1995) 165-200) : the tail at the N- 
terminus of the protein which interacts with penton base and ensures the anchorage in the 
capsid, the shaft composed of a number of beta-sheet repeats and the knob which contains the 

20 trimerization signals (Hong et al. J. Virol. 70 (1996) 7071-7078) and the receptor binding 
moiety (Henri et al. J. Virol. 68 (1994) 5239-5246 ; Louis et al. J. Virol. 68 (1994) 4104- 
4106). 

The almost ubiquitous distribution of the CAR cellular receptor is thought to be 
primarily responsible for the broad cell tropism of the human serotype C adenoviruses. 

25 Consistent with this notion, the absence or reduced expression of this receptor has been 
shown to correlate with the poor sensitivity of certain cell types (e.g. lymphocytes, smooth 
muscle cells) to adenovirus transduction (Leon et al, 1998, , Proc. Natl. Acad. Sci. USA 95, 
13159-13164 ; March et al, 1995, Hum. Gene Ther. 6, 41-63). Moreover, numerous studies 
have now reported that primary tumor cells express only low levels of CAR (Li et al, 1999, 

30 Cancer Res. 59, 325-330 ; Miller et al, 1998, Cancer Res, 58, 5738-5748). 



)OClD: <WO_02096939A2_I_> 



WO 02/096939 



7 



PCT/EP02/05942 



The inability of adenovirus to efficiently transduce specific cell populations, together 
with its lack of strict tissue specificity, have thus stimulated increasing efforts to redirect the 
adenovirus tropism from its natural receptors to specific cell surface molecules. 

Recently, specific fiber mutations which eliminate the interaction with CAR were 
5 identified, demonstrating that the CAR binding site of the fiber knob domain can be mutated 
without adversely affecting the quaternary structure and overall conformation of the purified 
recombinant protein (W098/44121, WO01/16344 and WO01/38361). For example, fiber 
proteins carrying amino acid substitutions in the AB loop (involving Ser408 and Pro409), in 
the DG loop (e.g. involving Tyr 477) and in beta-strand F (e.g. involving Leu 485) or having 
10 two consecutive amino acids deleted in the DG loop were shown to alter CAR binding 
(Bewley et aL, 1999, Science 286, 1579-1583 ; Kirby et al., 1999, J. Virol 73, 9508-9514 ; 
Kirby et al., 2000, J. Virol. 74, 2804-2813). 

These viruses are structurally identical to native viruses and therefore constitute appropriate 
substrates for the insertion of targeting ligands (binding moieties) in the mutated fibers. In this 

15 respect, several groups have reported that insertion of stretches of lysine residues at the C- 
terminal end of the knob could lead to the generation of high titer viruses that were 
characterized by a 10 to 300 fold increase in their efficiency of infection of CAR-deficient 
cells, such as macrophages, endothelial cells, smooth muscle cells or T lymphocytes 
(Wickham et al, 1997, J. Virol 71, 8221-8229 ; Yoshida et al, 1998, Hum. Gene Ther. 9, 

20 2503-2515 ; Wickham et al, 1996, Nature Biotechnology 14, 1570-1573 ; Bouri et al, 1999, 
Hum. Gene Ther. 10, 1633-1640). Furthermore, the introduction of a peptide ligand binding 
the transferrin receptor in the fiber HI loop was shown to facilitate gene transfer to cells 
which over-express this receptor (Xia et al, 2000, J. Virol. 74, 11359-11366). Similarly, a 
HUVEC cell-binding peptide allowed a significant increase of the transduction efficiency of 

25 the retargeted vector towards these cells which are normally refractory to transduction 
(Nicklin et al, 2000, Circulation 102, 231-237). 

Further scientific teams addressed the adenoviral targetting issue by modifying the 
adenoviral fiber protein. For example, US 5,885,808 describes an adenovirus, or adenovirus- 
like particle, having a penton fibre comprising a modified binding specificity conferred by a 
30 binding moiety which is heterologous to the adenovirus and is incorporated as a fusion protein 
with the fibre protein allowing the adenovirus or adenovirus-like particle to bind to a target 
cell which is not the natural host cell of the virus, characterized in that the said penton fibre is 
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modified by the insertion or deletion or substitution of amino acid residues that disrupt the 
host-cell binding function so that the adenovirus or adenovirus-like particle is substantially 
incapable of binding the natural host cell. 

Similarly, US 6,057,155 provides a chimeric adenovirus fiber protein, which differs 
5 from the wild-type coat protein by the introduction of a non-native amino acid sequence in a 
conformationally-restrained manner. A vector comprising such a chimeric fiber protein is able 
to direct entry into cells more efficiently than a corresponding vector that is identical except 
for comprising a wild-type adenovirus fiber protein . The non-native amino acid sequences 
may represent a peptide motif that comprises an epitope for an antibody or a ligand for a cell 
10 surface receptor, that can be employed in cell targeting. 

US 5,756,086 discloses an adenovirus, wherein the adenovirus fiber protein includes a 
ligand which is specific for a receptor located on the surface of a desired cell type. The 
adenovirus may have at least a portion of the adenovirus fiber protein removed and replaced 
with a ligand which is specific for a receptor of a desired cell type, or the adenovirus may 
15 include a fusion protein of the adenovirus fiber protein and the ligand. Such an adenovirus 
may also include a gene(s) encoding a therapeutic agent(s) and may be "targeted" in order to 
deliver such gene(s) to a desired cell type. 

However, it is to be noted that the prior art does not teach to construct targeted 
adenoviral vectors comprising a targeting ligand in the pIX protein. 

20 Nevertheless, there is still an ongoing need for further optimized vectors based on 

adenoviruses in order to improve gene therapeutic applications. 

Thus, the technical problem underlying the present invention is the provision of 
alternative means and methods for producing adenovirus-based viruses and virus-like 
particles suitable for efficient gene delivery. Preferably, viruses or virus-like particles show an 
25 improved gene delivery efficiency as compared to prior art viruses or virus-like particles, in 
particular with regard to targeting capacity and/or reduction of undesired side effects. 

The solution to this technical problem is achieved by providing the embodiments 
characterized in the claims. 

Accordingly, the present invention relates to adenovirus pIX proteins which are 
30 modified by mutation of one or more amino acids of said pIX protein as compared to the 
corresponding wild-type pIX protein and/or so as to comprise a binding moiety, wherein the 
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presence of said modified pIX protein in a virus or virus-like particle results in an improved 
gene delivery efficiency of said virus or virus-like particle in a target cell as compared to the 
gene delivery efficiency of a corresponding virus or virus-like particle containing said 
corresponding wild-type pIX protein. 

5 The invention is based on the experimental data presented in the appended Examples. 

The experiments were conducted in order to precisely delineate the functional domains of pIX 
that are responsible for the structural and transcriptional properties of the protein, in particular 
by performing an extensive mutational analysis of the pIX coding sequence . More precisely, 
it has been shown that the highly conserved N-terminal part of the protein is essential for the 
10 capsidic structural properties, whereas the C-terminal leucine-repeat (putative coiled-coil 
domain) is critical for the transactivating function and can be modified without altering the 
structural function of pIX. 

It is one of the objects in the design of adenoviral gene therapy vectors to delete 
unnecessary parts of the viral genome. In the experiments underlying the present invention it 

15 has been shown that parts of the pIX protein can be deleted or otherwise modified without 
impairing for example the viability or stability of the end-product, e.g. an adenoviral particle. 
Therefore, the present invention proposes to modify the adenoviral pIX protein by inserting a 
ligand moiety in order to modify adenovirus specificity (e.g. enhanced transduction of poorly 
infected cells or restriction of infection to specific cells or categories of cells). In order to 

20 preserve the functionality of the pIX protein during the adenovirus life cycle, specific 
locations in the pIX protein have been identified for incorporation of the targeting binding 
moiety, especially within or at the C-terminal part of the pIX protein. Moreover, the present 
invention also provides specific mutations of the pIX protein, especially in the C-terminal 
leucine-repeat domain, which relay enhance the presentation of such a binding moiety at the 

25 surface of the virus or virus-like particle. 

In the context of the present invention, the term "adenovirus pIX protein" refers to a 
pEX protein encoded by an adenoviral genome which is known to be integrated into the capsid 
of virus or virus-like particles. The present invention encompasses the full length adenoviral 
pIX which is encoded by the complete coding sequence (i.e. from the initiator ATG codon to 
30 the stop codon); However, it is possible to employ a fragment thereof generated by internal 
deletion, or truncation having the properties as described herein. For illustrative purpose, the 
pIX-encoding sequence can be isolated from an adenoviral genome by conventional 
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recombinant techniques. The pIX gene is present at the left end of the adenoviral genome 
positioned between E1B and E2 regions, e.g. from nucleotides (nt) 3609 to 4031 in the Ad5 
genome (see Figure 6). 

The modified adenoviral pIX protein of the invention may originate (i.e. the source 
5 sequence for constructing the pIX protein of the invention may be obtained) from an 
adenovirus of human or animal (e.g. canine, avian, bovine, murine, ovine, porcine, simien and 
the like) or may be a hybrid comprising fragments of diverse origins. For instance, the 
adenovirus can be of subgroup A (e.g. serotypes 12, 18, 31), subgroup B (e.g. serotypes 3, 7, 
11, 14, 16, 21, 34, 35), subgroup C (e.g. serotypes 1, 2, 5, 6), subgroup D (e.g. serotypes 8, 9, 
10 10, 13, 15, 17, 19, 20, 22-30, 32, 33, 36-39, 42-47), subgroup E (serotype 4), subgroup F 
(serotype 40, 41), or any other adenoviral serotype. Preferably, however, the modified pIX of 
the invention originates from an adenovirus of subgroup C, with a special preference for Ad2 
or Ad5 serotype. 

Although the pIX protein may vary between the different human and animal 
15 adenovirus strains, it can be identified on the basis of nucleotide and amino acid sequences 
available from different sources (e.g. databases such as GenBank and literature publications) 
or by homology with the well characterized Ad5 sequences (disclosed in Genbank under 
accession number M73260 or in Chroboczek et aL, 1992, Virology 186, 280-285). By way of 
illustration, the pIX protein of Ad5 includes 140 amino acid residues including the initiator 
20 Met residue (as shown in SEQ ID NO: 1). Moreover, Figure 1 indicates the amino acid 
sequences of pIX proteins of a number of human and animal adenovirus strains. 

The term " modified by mutation of one or more amino acids" refers to one or more 
deletions, substitutions or insertions of one or more residues as compared to the wild type pIX 
protein, or any combination of these possibilities. When several mutations are contemplated, 

25 they can concern consecutive residues and/or non consecutive residues at any location of the 
pIX sequence. Mutation may be made in a number of ways known to those skilled in the art 
using recombinant techniques, including for instance by enzymatically cleaving from the pIX- 
encoding nucleotide sequence followed by modification and ligation of the fragment obtained, 
by site-directed mutagenesis (e.g. the "QuickChange site-directed mutagenesis" system of 

30 Stratagene) or by PCR techniques. Deletion mutations can comprise from about 1 to 30 amino 
acid residues, preferably not exceeding 15 amino acids. 
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According to one embodiment, the modification results in the insertion of a binding 
moiety into the pIX sequence, within or to the N-terminal part or within or to the C-terminal 
part of the pIX protein (as defined hereinafter), with a special preference for the latter. In this 
respect, the binding moiety can be inserted at the C-terminus or within about the 30 and more 
5 preferably, about the 20 residues preceding the C-terminus. The insertion of the binding 
moiety can be made between two pIX residues or by replacing one or more pIX residues. 

According to another embodiment, the modification results in the substitution of at 
least one amino acid residue as compared to the wild type pIX protein. Preferably, such a 
mutation is located within the C-terminal leucine repeat of the pIX protein. 

10 In another specific embodiment, the modification results in the mutation of the stop 

codon of the pIX sequence, in order to suppress its stop activity (e.g. by mutating the stop 
codon in a amino acid encoding codon). In this embodiment, translation will continue beyond 
the native stop codon and the the polyA sequence naturally present downstream of the pIX 
coding sequence will be translated into a stretch of polylysine which could be use as a binding 

15 moiety connected to the C-terminus of the pIX protein. 

Within the meaning of the invention, the term "improved gene deliveiy efficiency" 
refers to the property of a virus or virus-like particle harbouring a modified pIX protein of the 
invention to infect a target cell and/or to deliver a gene of interest to a target cell (1.) more 
specifically (i.e. showing an increased ratio of infection and/or gene delivery between target 
20 and non-target cell) and/or (2.) more efficiently (i.e. infection and/or gene delivery is 
enhanced in absolute terms) as compared to a corresponding virus-like particle that does not 
harbour the modified pIX protein. 

The improved gene delivery efficiency can be easily determined by comparing, using the 
25 techniques of the art, the infection property or the propensy to deliver a given gene of interest 
(e.g. a reporter gene) of the virus or virus-like particle harboring the modified pIX as 
compared to a related virus or virus-like particle harbouring a non modified (wild-type) pIX 
protein to target cells and non target cells, either iri vitro (e.g. in cultured cells) or in vivo (e.g. 
in animal models) and under the same experimental conditions. Suitable techniques include 
30 cell infectivity studies with appropriate cell lines, evaluation of cell attachment for example 
using labeled viruses (e.g. labeled with 3 H thymidine, as described in Roelvin et al., 1996, J. 
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Virol. 70, 7614-7621). For instance when the pIX protein is modified so as to comprise a 
binding moiety, such assays involve exposition of the viruses to a target cell (e.g. displaying 
at its surface the anti-ligand molecule recognized by the binding moiety) under standard 
conditions of infection. As a result, a virus or virus-like particle harboring a modified pIX 
5 protein of the present invention shows a propensity to infect the target cell with a better 
efficiency than a virus or virus-like particle harboring a non modified (wild-type) pIX protein, 
which means that the virus or virus-like particle harboring a modified pIX protein infects 
more efficiently or more rapidely said target cells than non target cells (that do not display at 
their surface such an anti-ligand molecule), whereas a virus or virus-like particle harboring a 

10 wild-type pIX protein will infect said target cells with a lower or a similar efficiency 
compared to non-target cells. Alternatively, the improved gene delivery efficiency provided 
by the modified pIX protein of the invention can also be evaluated by measuring the level of 
gene transfer (e.g. using a reporter gene). Such a measurement can be done using any 
techniques in the art including Western blotting, ELISA, immunodetection, enzymatic 

15 detection, biological activities and the like. 

Advantageously, the modified pIX protein of the present invention improves gene 
delivery efficiency when the infection or delivery of a gene of interest to a target cell 
measured with a virus or virus-like particle harbouring such a modified pIX is substantially 
increased by at least a factor of two as compared to that observed with a virus or virus-like 

20 particle harbouring a wild-type adenoviral pIX. Preferably, it is increase by at least about five, 
more preferably by at least about one order of magnitude, even more preferably by at least 
about two orders of magnitude as compared to that observed with the corresponding wild-type 
virus or virus-like particle. 

A "target cell" as used herein is a cell where infection of a virus or virus-like particle 

25 harboring the modified pIX protein of the invention is expected. « Target cell » refers to a 
single entity, or can be part of a larger collection of cells. Such a larger collection of cells can 
comprise, for instance, a cell culture (either mixed or pure), a tissue (e.g., epithelial or other 
tissue), an organ (e.g., heart, lung, liver, urinary bladder, muscle or other organ), an organ 
system (e.g., circulatory system, respiratory system, gastrointestinal system, urinary system, 

30 nervous system, integumentary system or other organ system), or an organism (e.g., a 
mammal, particularly a human, or the like). In the context of the present invention, the target 
cell is preferably a tumoral cell. When the pIX protein is modified so as to comprise a binding 
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moiety, the "target cell" may designate a unique type of cell or a group of different types of 
cells having as a common feature on their surface anti-ligand molecule(s) recognized by the 
binding moiety(s) comprised in said pIX protein. 

The term "and/or" whereever used herein includes the meaning of "and", "or" and "all 
5 or any other combination of the elements connected by said term". 

The term "about" or "approximately" as used herein means within 20%, preferably 
within 10%, and more preferably within 5% of a given value or range. 

The results disclosed in the present application on pIX functions and mapping of said 
functions on the adenoviral genome allow it to produce new virus and virus-like particles at 
10 least some of which can bind the target cell with high specificity and may deliver genetic 
material to the target cell; at least some of the viruses and virus-like particles may bind and 
deliver genetic material to the target cell, preferably without substantially binding to the 
natural host cell of the virus. 

The term "binding moiety" means a molecule that is exposed on the surface of the 
15 virus or virus-like particle which is able to bind to a molecule on the target cell. The "binding 
moiety" may be a molecule on the virus or virus-like particle that is modified in such a way 
that its binding specificity is changed, or it may be a molecule added to, and exposed on the 
surface of, the virus or virus-like particle to provide a new binding specificity. 

It is further preferred that the binding moiety is joined or fused to the virus or virus- 
20 like particles directly or indirectly by a spacer group. 

In the context of the present invention, the temi "adenovirus pIX protein comprising a 
binding moiety" means that the modified pIX protein of the invention is covalently bound to a 
binding moiety. Preferably, the covalent bond to the binding moiety is located within the N- 
terminal part or the C-terminal part, preferably the C-terminal leucine-repeat, of the pIX 
25 protein as defined further-below. 

In a preferred embodiment, the binding moiety is fused to the amino acid sequence of 
the protein, preferably within or to the N-terminal part or the C-terminal part, preferably the 
C-terminal leucine-repeat, of the pIX protein. In other words, this preferred embodiment 
means that the mutation of one or more amino acids of the pIX protein as mentioned above 
30 results in the presence of a binding moiety in the pIX protein. 
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It is particularly preferred that the binding moiety comprised by the pIX protein of the 
invention is capable to bind a target cell as it is described in detail further below. 

Any cell-binding protein or peptide or carbohydrate such as an oligosaccharide or lipid 
may be useful as a binding moiety, preferably for targeting the virus or virus-like particle to 
5 the cell, whereby polypeptides are preferred. For example, short linear stretches of amino 
acids, such as those constituting a peptide hormone may be useful, as may be domains of 
polypeptides that can fold independently into a structure that can bind to the target cell. 

In one preferred embodiment, the binding moiety may be a monoclonal antibody or 
binding fragment thereof, an ScFv (single chain Fv fragment), a dAb (single domain 
10 antibody) or a minimal recognition unit of an antibody. The binding site on the target cell may 
be a target cell-specific antigen. 

The binding moiety may be a monoclonal antibody. Monoclonal antibodies which will 
bind to many of these antigens are already known but in any case, with today's techniques in 
relation to monoclonal antibody technology, antibodies can be prepared to most antigens. The 

15 binding moiety may be a part of an antibody (for example a Fab fragment) or a synthetic 
antibody fragment (for example, ScFv). Suitable monoclonal antibodies to selected antigens 
may be prepared by known techniques, for example those disclosed in "Monoclonal 
Antibodies: A manual of techniques", H. Zola (CRC Press, 1988) and in "Monoclonal 
Hybridoma Antibodies: Techniques and Applications", J. G. R. Hurrell (CRC Press, 1982). 

20 Suitably prepared non-human antibodies can be "humanized" in known ways, for example by 
inserting the CDR regions of mouse antibodies into the framework of human antibodies. The 
variable heavy (V.sub.H) and variable light (V.sub.L) domains of the antibody are involved in 
antigen recognition, a fact first recognised by early protease digestion experiments. Further 
confirmation was found by "humanization" of rodent antibodies. 

25 That antigenic specificity is conferred by variable domains and is independent of the 

constant domains is known from experiments involving the bacterial expression of antibody 
fragments, all of which containing one or more variable domains. These molecules include 
Fab-like molecules (Better et al (1988) Science 240, 1041); Fv molecules (Skerra et al (1988) 
Science 240, 1038); ScFv molecules where the V(-j and V|_ partner domains are linked via a 

30 flexible oligopeptide (Bird et al (1988) Science 242, 423; Huston et al (1988) Proc. Natl. 
Acad. Sci. USA 85, 5879) and dAbs comprising isolated V domains (Ward et al (1989) 
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Nature 341, 544). A general review of the techniques involved in the synthesis of antibody 
fragments which retain their specific binding sites is to be found in Winter & Milstein (1991) 
Nature 349, 293-299. 

The term "ScFv molecules" refers to molecules wherein the Vjj and Vl partner 
5 domains are linked via a flexible oligopeptide. 

It may be advantageous to use antibody fragments, rather than whole antibodies. 
Effector functions of whole antibodies, such as complement binding, may be removed in such 
fragments. ScFv and dAb antibody fragments can be expressed as fusions with other 
polypeptides. 

10 Minimal recognition units may be derived from the sequence of one or more of the 

complementary-determining regions (CDR) of a Fv fragment. Whole antibodies, and F(ab ! )2 
fragments are "bivalent". By "bivalent", it is meant that said antibodies andF(ab , )2 fragments 
have two antigen combining sites. In contrast, Fab, Fv, ScFv, dAb fragments and minimal 
recognition units are monovalent, having only one antigen combining sites. 

15 In a further preferred embodiment, the binding moiety is at least part of a ligand of a 

target cell-specific cell-surface receptor. 

A particular cell-surface receptor can be present on a narrow class of cell types or a 
broader group encompassing several cell types. The present invention also encompasses the 
use of a binding moiety targeting cells within any organ or system, including for example, 

20 respiratory system (trachea, upper airways, lower airways, alveoli), nervous system and 
sensitory organs (e.g. skin, ear, nasal, tongue, eye), digestive system ( e.g. oral epithelium, 
salivary glands, stomach, small intestines, duodenum, colon, gall bladder, pancreas, rectum), 
muscular system (e.g. cardiac muscle, skeletal muscle, smooth muscle, connective tissue, 
tendons, etc), immune system (e.g. bone marrow, stem cells, spleen, thymus, lymphatic 

25 system, etc), circulatory system (e.g. muscles connective tissue, endothelia of the arteries, 
veins, capillaries, etc), reproductive sytem (e.g. testis, prostate, cervix, ovaries), urinary 
system (e.g. bladder, kidney, urethra), endocrine or exocrine glands (e.g. breast, adrenal 
glands, pituitary glands), etc. 

For example, binding moieties suitable for targeting liver cells include but are not 
30 limited to those derived from ApoB (apolipoprotein) able to bind to the LDL receptor, alpha- 
2-macroglobulin able to bind to the LPR receptor, alpha- 1 acid glycoprotein able to bind to 
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the asialoglycoprotein receptor and transferrin able to bind to the transferrin receptor. A 
binding moiety for targeting activated endothelial cells may be derived from the sialyl-Lewis- 
X antigen (able to bind to ELAM-1), from VLA-4 (able to bind to the VCAM-1 receptor) or 
from LFA-1 (able to bind to the ICAM-1 receptor). A binding moiety derived from CD34 is 
5 useful to target the hematopoietic progenitor cells through binding to the CD34 receptor. A 
binding moiety derived from ICAM-1 is more intended to target lymphocytes through binding 
to the LFA-1 receptor. The targeting of T-helper cells may use a binding moiety derived from 
HTV gp-120 or a class II MHC antigen capable of binding to the CD4 receptor. The targeting 
of neuronal, glial, or endothelial cells can be performed through the use of binding moieties 

10 directed for example to tissue-factor receptor (e.g. ELT-1, CD31, CD36, Cd34, CD105, 
CD13, ICAM-1 ; McCormick et aL, 1998, J. Biol. Chem. 273, 26323-26329), 
thrombomodulin receptor (Lupus et al., 1998, Suppl. 2 S120), VEGFR-3 (Lymboussaki et al., 
1998, Am. J. Pathol. 153, 395-403), VCAM-1 (Schwarzacher et al., 1996, Artherosclerosis 
122, 59-67) or other receptors. The targeting of blood clots can be made via fibrinogen or 

15 aHbb3 peptide. Finally, inflamed tissues can be targeted through selectins, VCAM-1, ICAM- 
1, etc. 

Moreover, suitable binding moieties also include linear stretches of amino acids, such 
as polylysine, polyarginine and the like recognized by integrins. Also, a binding moiety can 
comprise a commonly employed tag peptide (e.g. short amino acids sequences known to be 
20 recognized by available antisera), such as sequences from glutathione-S -transferase (GST) 
from Shistosoma manosi, thioredoxin beta galactosidase, or maltose binding protein (MPB) 
from E. coli, human alkaline phosphatase, the FLAG octapeptide or hemagluttinin (HA). 

It will be appreciated by those skilled in the art that binding moieties which are 
polypeptides may be conveniently made usin^ recombinant DNA techniques. The binding 
25 moiety may be fused to the pIX protein of the virus or virus-like particle or they may be 
synthesised independently of the virus or virus-like particle, by expression from a suitable 
vector in a suitable host and then joined to the virus or virus-like particle. 

Nucleic acid sequences encoding many of potentially useful targeting binding moieties 
are known, for example those for peptide hormones, growth factors, cytokines and the like 
30 and may be readily found by reference to publicly accessible nucleotide sequence databases 
such as EMBL and GenBank. Once the nucleotide sequence is known it is obvious to the 
person skilled in the art how to make DNA encoding the chosen binding moiety using, for 
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example, chemical DNA synthetic techniques or by using the polymerase chain reaction to 
amplify the required DNA from genomic DNA or from tissue-specific cDNA. 

Many cDNAs encoding peptide hormones, growth factors, cytokines and the like, all 
of which may be useful as binding moieties, are generally available from, for example British 
5 Biotechnology Ltd, Oxford, UK. 

It is preferred that, when a virus or virus-like particle comprising the plX protein of 
the invention binds to its target cell, it delivers its nucleic acid to said target cell, that is the 
target cell is infected by the virus or virus-like particle. Target cells, especially cancer cells, 
that are infected in this manner by the virus or virus-like particle may express viral molecules 
10 on their surface and may be recognised by the immune system and destroyed. Of course, other 
cytotoxic functions of the virus may also kill the cell. 

Targeting can be achieved by first identifying a suitable address at the cellular surface 
and then constructing a virus or virus-like particle which comprises a plX protein comprising 
a binding moiety that they can recognize this address. For the identification of suitable target 
15 cell-specific addresses and molecules capable of binding these addresses suitable methods are 
described in the art. 

It has for instance been shown that a cell type or a disease-affected cell expresses 
unique cell surface markers. For example, endothelial cells in rapidely growing tumors 
express cell surface proteins not present in quiescent endothelium, .e.g. ccVv integrins (Brooks 

20 et al. Science 264 (1994) 569) and receptors for certain angiogenic growth factors (Hanahan 
Science 277 (1997) 48). Phage display library selection methods can be employed to select 
peptide sequences that interact with these particular cell surface markers (see for example US 
5,622 699 US 5,223,409 and US 5,403,484). In this system, a random peptide is expressed on 
the phage surface by fusion of the corresponding coding sequence to a gene encoding one of 

25 the phage surface proteins. The desired phages are selected on the basis of their binding to the 
target such as isolated organ fragments (ex vivo procedure) or cultured cells (in vitro 
procedure). Identification of targeting peptides can also be done by an in vivo procedure that 
involves injecting phage libraries into mice and subsequently rescuing the bound phages from 
the targeted organs. Selected peptides are identified by sequencing the region of the phage 

30 genome encoding the displayed peptide. In vivo organ screening was successfully applied to 
isolate peptide sequences that conferred selective phage homing to the brain and kidney 
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(Pasqualini et al., Nature 380 (1996) 364-366), to the vasculature of lung, skin and pancreas 
(Rajotte et al., J. Clin. Invest. 102 (1998) 430-437) and to several tumor types (Pasqualini et 
al., Nature Biotechnology 15 (1997) 542-546). 

Furthermore, tumors could be targeted not only via their vasculature but also via the 
extracellular matrix (ECM) or the tumor cells themselves. Since blood vessels are constantly 
modified in tumors, the endothelium is locally disrupted allowing gene therapy vectors to 
extravasate and interact with the ECM and tumor cells. Peptides which interact with the ECM 
or tumor-associated cell surface markers could also be selected using the phage display 
technique (Christiano et al. Cancer Gene Therapy 3 (1996) 4-10 ; Cfoce et al. Anticancer Res. 
17 (1997) 4287-4292 ; Gottschalk et al. Gene Then 1 (1994) 185-191 ; Park et al. Adv 
Pharmacol. 40 (1997) 399-435). As an example, a HWGF motif was identified as a ligand of 
the matrix metalloproteinases involved in tumor growth, angiogenesis and metastasis. 
Administration of a HWGF-comprising peptide to a tumour-bearing animal model prevents 
tumor growth and invasion and prolongs animal survival (Koivunen et al. Nature 
Biotechnology 17 (1999) 768-774). 

Recently, Romanczuk et al. (Human Gene Therapy 10 (1999) 2615-2626) reported the 
isolation of peptides targeting the differentiated, cilliated airway epithelial cells. Coupling of 
the best binding peptide to the surface of a recombinant adenovirus with Afunctional 
polyethylene glycol (PEG) resulted in a vector able to transduce the target cells via an 
alternative pathway dependent on the incorporated peptide. 

In cases where the binding moiety is not part of a fusion protein with the pIX protein, 
the binding moiety and the pIX protein may be linked together by any of the conventional 
ways of cross-linking polypeptides, such as those generally described in O'Sullivan et al 
(Anal. Biochem. (1979) 100, 100-108). For example, the binding moiety may be enriched 
with thiol groups and the molecule on the surface of the virus or virus-like particle, i.e. the 
pIX protein, may be reacted with a bifunctional agent capable of reacting with those thiol 
groups, for example with the N-hydroxysuccinimide ester of iodoacetic acid (NHIA) or N- 
succinimidyl-3-(2-pyridyldithio)propionate (SPDP). Amide and thioether bonds, for example 
achieved with m-maleimidobenzoyl-N-hydroxysuccinimide ester, are generally more stable in 
vivo than disulphide bonds. 

Other chemical procedures may be useful in joining oligosaccharide and lipids to 
polypeptides. 
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Covalent coupling between the binding moiety and the pIX protein may also be 
performed using a polymer such as polyethylene glycol (PEG) or its derivatives (see for 
example WO99/40214 ; Bioconjugate Techniques, 1996, 606-618 ; ed G Hermanson ; 
Academic Press and Frisch et al., 1996, Bioconjugate Chem. 7, 180-186). The binding moiety 
5 and the pIX protein may also be non-covalently coupled, for example via electrostatic 
interactions or through the use of affinity components such as protein A, biotin/avidin, which 
are able to associate both partners. Immunological coupling can also be used in the context of 
the present invention, for example using antibodies to conjugate the selected binding moiety 
to the pEX protein. For example, it is possible to use biotinylated antibodies directed to a 
10 surface-exposed pIX epitope and streptavidin-labelled antibodies directed against the selected 
binding moiety according to the technique disclosed by Roux et al. (1989, Proc. Natl. Acad 
Sci USA 86, 9079). Bifunctional antibodies directed against each of the coupling partners are 
also suitable for this purpose. 

It is preferred that the binding moiety is a polypeptide. Preferably, the binding moiety 
15 is joined to the pDC protein in that both polypeptides are produced as a fusion by techniques 
of genetic engineering. The use of genetic engineering allows for the precise control over the 
fusion of such polypeptides. Thus, in a preferred embodiment, the modified pIX protein of the 
invention is a fusion between a binding moiety and the unmodified pIX protein, 
advantageously, the binding moiety is fused to the N-terminus of the pIX protein. Even more 
20 preferably, the binding moiety is fused within or to the C-terminal part of the pIX protein. 
Generally, the fusion site is selected in such a way to lead to maximal presentation of the 
binding moiety to its corresponding cell-surface partner, and/or to not disturb the interaction 
with the other capsid viral proteins that are known to interact with pIX (e.g. fiber, penton base 
and/or hexcn). 

25 More precisely, the binding moiety can be fused to the C-terminus or between two 

residues located within the C-terminal part of the pIX protein, or still in replacement of one or 
more residues located within the C-terminal part of the pIX protein. The first embodiment can 
be illustrated by the fusion of the binding moiety sequence just upstream of the stop codon. 
Another alternative could be the mutation of die stop codon in an amino acid encoding codon, 

30 so that the pEX polyA sequence that follows the stop codon will be translated in a stretch of 
several lysines (a binding moiety able to recognize cell-surface integrins). The second 
embodiment includes insertion of a binding moiety-encoding sequence between two codons 
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of the pIX protein, eventually through insertion of a restriction site (preferably a 6 nt length 
restriction site) in the pIX sequence so that the restriction site encodes one or more codons 
located between two residues of the wild type pIX. Such a restriction site can be used 
conveniently to fuse the binding moiety-encoding sequence. A convenient illustration of such 
5 a second embodiment can be provided by the insertion of a BamHI site between the codons 
encoding the leucine residue in position 131 and the lysine residue in position 132 of the pIX 
protein. An illustration of the third embodiment is the insertion of the binding moiety in 
replacement of the residues 128 to 140 of the wild-type pIX protein. This results in a 
truncated pIX protein connected at residue 127 with the binding moiety. 

L0 Preferably, the fusion is made in frame and does not disrupt the pIX open reading 

frame. Eventually, the binding moiety and the pIX protein can be connected through the use 
of one or more spacers, e.g. a first spacer at the N-terminus of the binding moiety and 
optionally a second spacer at the C- terminus of the binding moiety. The spacer is preferably 
made up of amino acid residues with high degrees of freedom of rotation, which permit the 

L5 binding moiety to adopt a conformation that is recognized by its corresponding cell-surface 
partner. Preferred amino acid residues for the spacer are alanine, glycine, proline and/or 
serine. In a specific embodiment, the spacer is a peptide comprising the sequence Ser-Ala, 
Gly-Ser, Pro-Ser-Ala or Pro-Gly-Ser or a repetition thereof. By way of illustration, the 
sequence Gly-Ser-(Ser-Ala)4-Ser is suitable for such a use. 

20 Accordingly, the invention relates to a nucleotide sequence encoding this fusion of the 

binding moiety and the pIX protein of the vims or virus-like particle. 

The nucleotide sequence encoding the fusion of the binding moiety and the pIX 
protein of the virus or virus-like particle is preferably made by an alteration of the viral 
genome. 

15 The nucleotide sequence may be synthesised de novo using solid phase 

phosphoramidite chemistry, but it is more usual for the nucleotide sequence to be constructed 
from two parts, the first encoding the binding moiety and the second the pIX protein of the 
virus or virus-like particle. The two parts may be derived from their respective genes by 
restriction endonuclease digestion or by other methods known by those skilled in the art such 

30 as by polymerase chain reaction. 
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A variety of methods have been developed to operatively link two nucleotide 
sequences via complementary cohesive termini. For instance, synthetic linkers containing one 
or more restriction sites provide a method of joining the two DNA segment together. Each 
DNA segment, generated by endonuclease restriction digestion, may be treated with 
bacteriophage T4 DNA polymerase or E. coli DNA polymerase I, enzymes that remove 
protruding, 3 -single-stranded termini with their 3'-5-exonucleolytic activities, and fill in 
recessed 3 -ends with their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA segments. 
The blunt-ended segments are then incubated with a large molar excess of linker molecules in 
the presence of an enzyme that is able to catalyze the ligation of blunt-ended DNA molecules, 
such as bacteriophage T4 DNA ligase. Thus, the products of the reaction are DNA segments 
carrying polymeric linker sequences at their ends. These DNA segments arc then cleaved with 
the appropriate restriction enzyme and lifted to an expression vector that has been cleaved 
with an enzyme that produces termini compatible with those of the DNA segment. 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International Biotechnologies Inc, 
New Haven, Conn., USA. A desirable way to generate the DNA encoding the fusion 
polypeptide of the invention is to use the polymerase chain reaction as disclosed by Saiki et al 
(1988) Science 239, 487-491. 

In this method each of the DNA molecules encoding the two polypeptides to be fused 
are enzymatically amplified using two specific oligonucleotide primers which themselves 
become incorporated into the amplified DNA. The said specific primers may contain 
restriction endonuclease recognition sites which may then be used to join the said two DNA 
molecules using T4 DNA ligase as disclosed. 

In a preferred embodiment, the invention concerns an adenovirus pIX protein modified 
by mutation of one or more amino acids of said pIX protein ais compared to the wild-type pIX 
protein, wherein said amino acids are selected in the N-terminal part of the protein or in the 
C-terminal part, preferably the C-terminal leucine-repeat of the protein. Preferably, the 
adenovirus pIX protein of the invention comprises a binding moiety at the C-terminal part of 
the protein, preferably at the C-terminal leucine -repeat. In an advantageous embodiment, 
said binding moiety is able to bind to a target cell. 
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The term "N-terminal part" as used herein refers to the portion of the pIX protein 
extending from about amino acid residue in position 1 to about amino acid residue in position 
39. With respect to Ad5 pIX protein, it refers more precisely to the portion of the pIX protein 
5 extending from about the methionine residue in position 1 to about the valine residue in 
position 39, as shown in SEQ ID NO: 1. 

The term "C-terminal part" as used herein refers to the portion of the pIX protein 
extending from about the first amino acid residue of the leucine repeat domain to about the 
last amino acid residue of the pIX protein. With respect to Ad5 pIX protein, it refers more 
10 precisely to the portion of the pIX protein extending from about the leucine residue in 
position 100 to about the valine residue in position 140, as shown in SEQ ID NO: 1. 

In another preferred embodiment, the invention concerns an adenovirus pIX protein 
modified by mutation of one or more amino acids of said pIX protein as compared to the 
wild-type pIX protein, wherein said amino acids are selected in the C-terminal leucine-repeat 
15 of the protein. Preferably, said modified pIX protein does not have a rigid helix structure. 

The term " C-terminal leucine-repeat " as used herein refers to the portion of the pIX 
protein rich in hydrophobic amino acid residues (e.g. leucine and/or valine). Preferably, the C- 
terminal leucine-repeat contains at least the sequence "(LXXLXXXLXX)n" where X is any 
20 amino acid residue and n is between 1 to 4. With respect to Ad5 pIX protein, it refers more 
precisely to the portion of the pIX protein extending from about the leucine residue in 
position 100 to about the leucine residue in position 121, as shown in SEQ ID NO: 1. 

Preferably, the mutation within the C-terminal leucine-repeat is aimed to destabilize 
the helix structure provided by said C-terminal leucine-repeat in the wild-type pIX protein. 

25 Such a mutation can affect one or more residue(s) involved in the helix structure, with a 
special preference for the hydrophobic residues such as leucine and/or valine. More precisely, 
such a mutation can be a deletion of all or part of the Leucine repeat domain or a mutation 
affecting one or more residues selected from the group consisting of the leucine in position 
100, the leucine in position 103, the leucine in position 104, the glutamine in position 106, the 

30 leucine in position 107, the leucine in position 110, the glutamic acid in position 113, the 
leucine in position 114, the valine in position 117, the leucine in position 121, of the wild type 
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Ad5 pIX protein (SEQ ID NO : 1). Even more preferably, the mutation is a substitution 
mutation of one or more residues corresponding to residues 100, 103, 104, 106, 107, 110, 113, 
114, 117 or 121 of the wild type Ad5 pIX protein (SEQ ID NO : 1). Preferred mutations 
involve the substitution of the aforementioned residues with a proline or a charged residue 
5 and most preferably : 

- the substitution of the leucine in position 1 14 by a proline (LI 14P), 

- the substitution of the valine in position 1 17 by an aspartic acid (V117D), or 

- the substitution of the leucine in position 114 by a proline and the substitution of 
the valine in position 117 by an aspartic acid (L114P-V117D or L-V). 

10 According to a preferred embodiment, such a mutated pIX protein is also modified so 

as to comprise a binding moiety. As described above, the binding moiety is preferably 
inserted within the C-terminal part of the mutated pIX protein, preferably after the leucine 
repeat (e.g. at the C-terminus, between residues 131 and 132 or after residue 127). 

15 In a further aspect, the invention relates to nucleic acid molecules comprising a 

nucleotide sequence encoding the adenovirus pIX protein of the invention as it is described 
herein above. 

Within the context of the present invention, the term "nucleic acid molecule" defines a 
20 polymeric form of any length of deoxyribonucleotides (DNA) or ribonucleotides (RNA). The 
nucleic acid molecule can be single or double-stranded, linear or circular. It is preferably a 
double-stranded DNA molecule. It may also comprise modified nucleotides, such as 
methylated nucleotides or nucleotide analogs (see US 5,525,711, US 4,711,955 or EPA 302 
175 as examples of modifications). If present, modifications to the nucleotide structure may 
25 be imparted before or after assembly of the polymer (such as by conjugation with a labeling 
component). The sequence of nucleotides may also be interrupted by nori-nucleotide 
elements. The nucleic acid molecule of the present invention can code for a full length 
modified pIX protein or for a fragment thereof (e.g. restriction endonuclease-generated and 
PCR-generated fragments that can be obtained therefrom). The present invention also 
30 encompasses synthetic fragments (e.g. produced by oligonucleotide synthesis). 

The nucleic acid molecule of the present invention is preferably a vector for cloning or 
expressing such modified pIX protein. Any type of vector can be used in the context of the 
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present invention, whether of plasmid or viral, integrating or nonintegrating origin. Such 
vectors are commercially available or described in the literature. Similarly, those skilled in 
the art are capable of adjusting the regulatory elements required for the expression of the 
DNA fragment of the invention. Preferably , said vector is an adenoviral vector capable of 
5 producing under suitable culturing conditions, virus or virus-like particles bearing at their 
surface a modified pIX protein according to the present invention (as described hereinafter). 

In a preferred embodiment, the nucleotide sequences comprised by the nucleic acid 
molecule of the invention encodes a fusion of the binding moiety with the native pIX protein 
10 or with a mutated pIX protein as described above. 

In a further aspect, the present invention relates to an adenoviral vector which 
comprises the nucleic acid molecule of the invention. The terms "adenoviral genome" and 
"adenoviral vector" are synonyms and generally refer to the genetic material contained in a 
virus or virus-like particle, preferably an adenovirus. More specifically, these terms designate 

15 a nucleic acid sequence of adenoviral origin comprising at least adenoviral ITR 5', ITR 3' and 
encapsidation (psi) or "packaging" sequence able to promote packaging of said adenoviral 
genome into an adenoviral particle in order to produce an adenovirus (or virion). Said genome 
or vector can further comprise all or part of the Ela, Elb, E2a, E2b, E3, E4 adenoviral 
regions. An adenoviral vector according to the present invention is derived from the genome 

20 of a natural or wild-type adenovirus, advantageously a canine, avian or human adenovirus, 
preferably a human adenovirus type 2, 3, 4, 5 or 7 and, as an absolute preference, a human 
adenovirus type 5 (Ad5). Li this latter case, the deletions of the adenoviral vector according to 
the invention are indicated by reference to the position of the nucleotides of the Ad5 genome 
which is specified in the GenBank data bank under the reference M73260 (see Figure 6 and 

25 SEQIDNO:l). 

An adenoviral vector according to the invention is preferably defective for replication, 
but capable of being replicated and encapsidated in a complementation cell which provides 
the vector in trans with the product(s) for which it is defective so as to generate an adenoviral 
particle comprising an adenoviral genome (also termed defective adenovirus) which is 
30 incapable of autonomous replication in a host cell but nevertheless infectious, since it has the 
capacity to deliver the vector to a target cell. However, an adenovirus vector according to the 
invention can further relate to replication competent vectors (i.e. not defective for replication ; 
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Alemany et al., 2000, Nature Biotechnology, 18, 723-727), as well as to conditionally 
replicative adenoviral vectors (CRAd ; Heise and Kirn, 2000, J. of Clinical Investigation, 105, 
847-851). 

Preferably, the nucleic acid molecule of the invention is placed in the adenoviral 
5 genome vector in replacement of the wild-type pIX-encoding gene, using the native pIX 
promoter to drive expression of said nucleic acid molecule. It is also possible to inactivate the 
wild-type pIX-encoding gene (e.g. by deletion or mutation) and to insert the nucleic acid 
molecule of the invention at another (non-native) location in the adenoviral region, either 
under the control of the native pIX promoter or under the control of an heterologous promoter 
10 (e.g. an inducible or constitutive promoter) to make said nucleic acid molecule expressed 
when desired (e.g. in an appropriate cell during the process of preparation of a virus or virus- 
like particle, such as the 293 or PER-C6 cell line). The promoters described in connection 
with the gene of interest (or exogenous sequence) are suitable in this context. 

Furthermore, the adenoviral vector of the present invention can be further modified 
15 especially to reduce or abolish interaction with the cellular receptors which normally mediate 
virus attachment and/or entry in the target cells (e.g. interaction between the fiber and the 
CAR receptor, between penton base and integrins and the like) 

In this respect, it is known that the host specificity of Ad2 and Ad5 is different from 
that of Ad3 and Ad7 with respect to CAR-mediated pathway. Thus, it would be advantageous 

20 to replace one or more residues of an Ad5 or Ad2 fiber involved in CAR-binding with 
equivalent residues originating from an equivalent region of an Ad3 or Ad7 fiber, so as to 
decrease the ability of said fiber to bind the CAR receptor. By way of illustration, suitable 
CAR-ablating mutations include those described in W098/44121, WO01/16344, 
WO/0138361 and WOOO/15823 as well as in Kirby et al. (2000, J. Virol. 74, 2804-2813) and 

25 Leissner et al. (2001, Gene Ther. 8, 49-57), with a special preference for the substitution of 
the serine in position 408 of the Ad5 fiber by a glutamic residue. 

The adenoviral fiber can be further modified for example in the shaft region to provide 
a « short shafted » fiber, for example as described in US patent 5,962,311. By "short-shafted 
fiber" a fiber is meant whose shaft is shorter than that which is present in a given naturally 

30 occurring, i.e., wild-type, adenovirus. For example, a shaft is shorter than that which is 
present in Ad2 or Ad5. The shaft can be shortened by replacement of a longer fiber with a 
shorter fiber, which may be of a different serotype. In this regard, the fiber shaft and knob can 
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be of the same serotype or the shaft can be of one serotype and the knob can be of another 
serotype. For instance, an Ad9 fiber shaft can be used with an Ad2 or Ad5 knob. Preferably, 
however, the shaft is shortened by deletion of a portion of the shaft, preferably a complete 
repeat distal to the tail. Preferably, the shaft comprises at least about six repeats, more 
preferably from about six to about twelve repeats. 

In the context of the present invention, an adenoviral vector has as its objective the 
transfer of an exogenous nucleotide sequence (or gene of insterest) to a target cell and its 
expression therein. "Exogenous nucleotide sequence" is understood to mean a nucleic acid 
which comprises at least one coding sequence and, preferably also, regulatory sequences 
permitting the expression of said coding sequence(s). The exogenous nucleotide sequence and 
preferably the coding sequence(s) comprised by it are sequences which are normally not 
present in the genome of an adenovirus. The exogenous nucleotide sequence may be 
introduced into an adenoviral vector according to the invention by standard techniques of 
genetic engineering (see e.g. Sambrook and Russell (2001), Molecular Cloning, CSH Press, 
NY, USA), preferably by homologous recombination as disclosed in EP 742 834, 
preferentially between the encapsidation region and the 3' ITR. Such an adenoviral vector 
comprising an exogenous nucleotide sequence is called "recombinant" as opposed to the 
"wild type" adenoviral vector or corresponding adenoviral vectors which are modified as 
described above, but do not contain an exogenous nucleotide sequence. 

The term "gene" refers to a nucleic acid comprising a coding sequence that may 
contain introns, or a fragment thereof, or a cDNA,or a fragment thereof. 

Preferably, the exogenous nucleotide sequence comprises a gene suitable for gene 
therapy, i.e. is therapeutically useful. 

Among genes of interest which are preferably usable in the context of the present 
invention, there may be mentioned: 

the genes coding for cytokines such as interferon alpha, interferon gamma, 
beta-interferon, interleukins; 

the genes coding for membrane receptors such as the receptors recognized by 
pathogenic organisms (viruses, bacteria or parasites), preferably by the HIV virus (human 
immunodeficiency virus); 

the genes coding for coagulation factors such as factor Vm and factor IX; 
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the gene coding for dystrophin; 
the gene coding for insulin; 

the genes coding for proteins participating directly or indirectly in cellular ion 
channels, such as the CFTR (cystic fibrosis transmembrane conductance regulator) protein; 

the genes coding for antisense RNAs or proteins capable of inhibiting the 
activity of a protein produced by a pathogenic gene, present in the genome of a pathogenic 
organism, or by a cellular gene, the expression of which is deregulated, for example an 
oncogene; 

the genes coding for a protein inhibiting an enzyme activity, such as alphal - 
antitripsin or a viral protease inhibitor; 

the genes coding for variants of pathogenic proteins which have been mutated 
so as to impair their biological function, such as, for example, trans-dominant variants of the 
TAT protein of the HIV virus which are capable of competing with the natural protein for 
binding to the target sequence, thereby preventing the activation of HIV; 

the genes coding for antigenic epitopes in order to increase the host cell's 

immunity; 

the genes coding for major histocompatibility complex classes I and II proteins, 
as well as the genes coding for the proteins which are inducers of these genes; 

the genes coding for cellular enzymes or those produced by pathogenic 
organisms; and 

suicide genes. The TK-HSV-1 suicide gene may be mentioned more especially. 
The viral TK enzyme displays markedly greater affinity compared to the cellular TK enzyme 
for certain nucleoside analogues (such as acyclovir or gancyclovir). It converts them to 
monophosphated molecules, which can themselves be converted by the cellular enzymes to 
nucleotide precursors, which are toxic. These nucleotide analogues can be incorporated in 
DNA molecules undergoing synthesis, hence chiefly in the DNA of cells in a state of 
replication. This incorporation enables dividing cells such as cancer cells to be destroyed 
specifically. 

This list is not restrictive, and other genes of interest may be used in the context of the 
present invention. 
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In one embodiment, the gene is a suicide gene encoding a molecule having a directly 
or indirectly cytotoxic function. By "directly or indirectly" cytotoxic, it is meant that the 
molecule encoded by the gene may itself be toxic (for example ricin; tumour necrosis factor; 
interleukin-2; interferon-gamma; ribonuclease; deoxyribonuclease; Pseudomonas exotoxin A) 
or it may be metabolised to form a toxic product, or it may act on something else to form a 
toxic product. The sequence of ricin cDNA is disclosed in Lamb et al (1985) Eur. J. Biochem. 
148, 265-270 incorporated herein by reference. 

For example, it may be desirable to target an exogenous DNA sequence encoding an 
enzyme using the adenoviral vector of the invention or a virus or virus-like particle containing 
it, the enzyme being one that converts a relatively non-toxic prodrug to a toxic drug. The 
enzyme cytosine deaminase converts 5-fluorocytosine (5FC) to 5-fluorouracil (5FU) (Mullen 
et al (1922) PNAS 89, 33); the herpes simplex enzyme thymidine kinase sensitises cells to 
treatment with the antiviral agent ganciclovir (GCV) or aciclovir (Moolten (1986) Cancer 
Res. 46, 5276; Ezzedine et al (1991) New Biol 3, 608). The cytosine deaminase of any 
organism, for example E. coli (EP 402 108) or Saccharomyces cerevisiae (Erbs et al., 1997, 
Curr. Gennet. 31, 1-6 ; WO93/01281), may be used. 

Thus, in a preferred embodiment of the invention, the gene encodes a cytosine 
deaminase. In accordance with this embodiment, the patient may be concomitantly given 5FC 
and a virus or virus-like particle expressing cytokine deaminase. By "concomitantly", it is 
meant that 5FC is administered at such a time, in relation to the transformation of the target 
cells, such as tumour cells, that 5FC is converted into 5FU in the target cells by the cytosine 
deaminase expressed from the said gene. A dosage of approximately 0.001 to 100.0 mg 5C/kg 
body weight/day, preferably 0.1 to 10.0 mg/kg/day is suitable. 

The use of a combination of suicide gene products, e.g. cytosine deaminase and uracil 
phosphoribosyl transferase activity can also be envisaged in the context of the invention. 
Suitable genes encoding uracil phosphoribosyl transferase include those from E. coli 
(Anderson et al., 1992, Eur. J. Biochem. 204, 51-56) and Saccharomyces cerevisiae (Kern et 
al., 1990, Gene 88, 149-157). Advantageously, the exogenous DNA sequence in use in the 
present invention encodes a polypeptide having both cytosine deaminase and uracil 
phosphoribosyl transferase activities. Cytosine deaminase deaminates the 5-FC analog, therby 
forming 5-fluorouracil (5-FU), which is highly cytotoxic when it is converted into 5-fluoro- 
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UMP by uracil phosphoribosy! transferase action. Such a polypeptide is described for 
example in W096/16183 and W099/5448L 

Components, such as 5FC, which are converted from a relatively non-toxic form into a 
cytotoxic form by the action of an enzyme are termed "pro-drugs". 

5 Other examples of pro-drug/enzyme combinations include those disclosed by 

Bagshawe et al (WO88/07378), namely various alkylating agents and the Pseudomonas spp. 
CPG2 enzyme, and those disclosed by Epenetos & Rowlinspn-Busza (WO 91/11201), namely 
cyanogenic pro-drugs (for example amygdalin) and plant-derived beta-glucosidases. 

Enzymes that are useful in this embodiment of the invention include, but are not 
10 limited to, alkaline phosphatase useful for converting phosphate-containing prodrugs into free 
drugs; arylsulfatase useful for converting sulfate-containing prodrugs into free drugs; cytosine 
deaminase useful for converting non-toxic 5-fluorocytosine into the anti-cancer drug, 5- 
fluorouracil; proteases, such as serratia protease, thermolysin, subtilisin, carboxypeptidases 
and cathepsins (such as cathepsins B and L), that are useful for converting peptide-containing 
15 prodrugs into free drugs; D-alanylcarboxypeptidases, useful for converting prodrugs that 
contain D-amino acid substituents; carbohydrate-cleaving enzymes such as beta-galactosidase 
and neuraminidase useful for converting glycosylated prodrugs into free drugs; beta- 
lactamase useful for converting drugs derivatized with beta-lactams into free drugs; and 
penicillin amidases, such as penicillin V amidase or penicillin G amidase, useful for 
20 converting drugs derivatiized at their amine nitrogens with phenoxyacetyl or phenylacetyl 
groups, respectively, into free drugs. Alternatively, antibodies with enzymatic activity, also 
known in the art as abzymes, can be used to convert the prodrugs of the invention into free 
active drugs (see, e.g. R. J. Massey, Nature, 328, pp. 457-458 (1987)). 

Similarly, the prodrugs include, but are not limited to, the above-listed prodrugs, e.g., 
25 phosphate-containing prodrugs, thiophosphate-containing prodrugs, sulfate-containing 
prodrugs,peptide-containing prodrugs, D-amino acid-modified prodrugs, glycosylated 
prodrugs, beta-lactam-containing prodrugs, optionally substituted phenoxyacetamide- 
containing prodrugs or optionally substituted phenylacetamide-containing prodrugs, 5- 
fluorocytosine and other 5-fluorouridine prodrugs which can be converted by the enzyme 
30 from the conjugate into the more active, cytotoxic free drug. Examples of cytotoxic drugs that 
can be derivatized into a prodrug form for use in this invention include, but are not limited to, 
etoposide, teniposide, adriamycin, daunomycin, carminomycin, aminopterin, dactinomycin, 
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mitomycins, cis-platinum and cis-platinum analogues, bleomycins, esperamicins (see U.S. 
Pat. No. 4,675,187), 5-fluorouracil, melphalan and other related nitrogen mustards. 

In a further embodiment, the gene delivered to the target cell encodes a ribozyme 
capable of cleaving targeted RNA or DNA. The targeted RNA or DNA to be cleaved may be 
5 RNA or DNA which is essential to the function of the cell and cleavage thereof results in cell 
death or the RNA or DNA to be cleaved may be RNA or DNA which encodes an undesirable 
protein, for example an oncogene product, and cleavage of this RNA or DNA may prevent the 
cell from becoming cancerous. 

In a still further embodiment, the gene delivered to the target cell encodes an antisense 

10 RNA. 

The term "antisense RNA" means an RNA molecule which hybridises to, and 
interferes with the expression of an mRNA molecule encoding a protein or to another RNA 
molecule within the cell such as pre-mRNA or tRNA or rRNA, or hybridises to, and interferes 
with the expression of a gene. 

15 Furthermore, a gene delivered to the target cell may also encode other species of RNA 

molecules that are capable of influencing gene expression of the cell such as RNA molecules 
that may exert an RNA interference (RNAi) or a co-suppression effect. 

Conveniently, a gene expressing an antisense RNA may be constructed by inserting a 
coding sequence encoding a protein adjacent to a promoter in the appropriate orientation such 
20 that the transcribed RNA is complementary to the target mRNA. Suitably, the antisense RNA 
blocks expression of undesirable polypeptides such as oncogenes, for example ras, bcl, src or 
tumour suppressor genes such as p53 and Rb. 

It will be further appreciated that DNA sequences suitable for being expressed as an 
antisense RNA may be readily derived from publicly accessible databases such as GenBank 
25 and EMBL. 

In another embodiment of the invention, the gene may replace the function of a 
defective gene in the target cell. There are several thousand inherited genetic diseases of 
mammals, including humans, that are caused by defective genes. Examples of such genetic 
diseases include cystic fibrosis, which is known to be caused by a mutation in the CFTR gene; 
30 Duchenne muscular dystrophy, which is known to be caused by a mutation in the dystrophin 
gene; sickle cell disease, which is known to be caused by a mutation in the HbA gene. Many 
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types of cancer are caused by defective genes, especially protooricogenes, and tumour- 
suppressor genes that have undergone mutation. 

Thus, it is preferred that an adenoviral vector of the invention or a virus or virus-like 
particle containing it, which may be useful in the treatment of cystic fibrosis, contains a 
5 functional CFTR gene to replace the function of the defective CFTR gene. Similarly, it is 
preferred that an adenoviral vector of the invention, which may be useful in the treatment of 
cancer, contains a functional protooncogene or tumour-suppressor gene to replace the 
function of the defective protooncogene or tumour-suppressor gene. 

Examples of protooncogenes are ras, src, bcl and so on; examples of tumour- 
10 suppressor genes are p53 and Rb. 

It will be appreciated that the gene will be introduced into a convenient place within 
the adenoviral vector and will contain a promoter and/or enhancer element to drive its 
expression. 

It is preferred that the promoter and/or enhancer element is selective for the cells to be 
15 targeted. Some examples of tissue or tumour specific promoters are given below but new ones 
are being discovered all of the time which will be useful in this embodiment of the invention. 

For example, the mucin gene, MUC1, contains 5' flanking sequences which are able to 
direct expression selectively to breast and pancreatic cell lines, but not to non-epithelial cell 
lines as it is taught in WO 91/09867. 

20 In the context of the present invention, an exogenous nucleotide sequence can consist 

of one or more gene(s) of interest, and preferably is of therapeutic interest. In the context of 
the present invention, a gene of interest can for example code for an antisense RNA, or for an 
mRNA which will then be translated into a protein of interest. A gene of interest can be of 
genomic type, of complementary DNA (cDNA) type or of mixed type (e.g. a minigene, in 

25 which at least one intron is deleted). It can code for a mature protein, a precursor of a mature 
protein, in particular a precursor intended to be secreted and accordingly comprises a 
corresponding signal peptide, a chimeric protein originating from the fusion of sequences of 
diverse origins, or a mutant of a natural protein displaying improved or modified biological 
properties. Such a mutant may be obtained by mutation, deletion, substitution and/or addition 

30 of one or more nucleotide(s) of the gene coding for the natural protein. Other genes of interest 
may for instance encode a ribozyme or an RNA interference (RNAi) construct, capable of 
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inliibiting gene expression of an endogenous gene in the transfected cell. Means and methods 
for constructing ribozymes and RNAi constructs are known to the person skilled in the art and 
are well documented in the pertinent literature. 

A gene of interest may be placed under the control of elements suitable for its 
5 expression the target cell. "Suitable elements" are understood to mean the set of elements 
needed for the transcription of the gene of interest into RNA (e.g. antisense RNA or mRNA) 
and for the translation of an mRNA into protein. Among the elements needed for 
transcription, the promoter assumes special importance. It can be a constitutive promoter or a 
regulatable promoter, and can be isolated from any gene of eukaryotic or viral origin, and 

10 even adenoviral origin. Alternatively, it can be the natural promoter of the gene of interest in 
question. Generally speaking, a promoter used in the present invention may be modified so as 
to contain regulatory sequences. As examples, a gene of interest in use in the present 
invention may be placed under the control of the promoter of one of the immunoglobulin 
genes when it is desired to target its transfer to lymphocytic host cells. There may also be 

15 mentioned the TK-HSV-1 (herpesvirus, type 1 thymidine kinase) gene promoter or 
alternatively the adenoviral MLP promoter, in particular from human adenovirus type 2, 
permitting expression in a large number of cell types. 

In a preferred embodiment, the transcriptional control sequence comprises a promoter 
element. Preferably, one would use a high expression promoter. Said promoter may be for 

20 example selected from the group consisting of viral promoters and tissue- or celltype-specific 
promoters such as muscle specific promoters, or a combination thereof. Examples of such 
viral promoters are the SV40 early and late promoters, the adenovirus major late promoter, 
the Rous Sarcoma Virus (RSV) promoter, the Cytomegalovirus (CMV) immediate-early 
promoter, the herpes simplex virus (HSV) promoter, the MPSV promoter, the 7.5k promoter, 

25 the vaccinia promoter and the Major-intermediate-early (MIE) promoter. Examples of muscle 
specific promoters are the smooth muscle 22 (SM22) promoter, die myosin light chain 
promoter, the myosin heavy chain promoter, the skeletal alpha-actin promoter and the 
dystrophin promoter. The Cytomegalovirus (CMV) immediate-early promoter is however 
preferred. The natural promoter of the beta-interferon encoding sequence may also be used 

30 (US 4,738,931). The polynucleotide sequence of the promoter can be a naturally occurring 
promoter sequence isolated from biological nucleic acid material or chemically synthesized. 
The promoter sequence can also be artificially constructed by assembling elements previously 
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screened for transcriptional activity leading to potencies which can exceed those of naturally 
occurring ones (Li et al., 1999, Nature Biotech., 17, 241-245). The expression cassette 
(comprising coding sequence and promoter) can be constructed using routine cloning 
techniques known to persons skilled in the art (for example, see Sambrook et al., 2001, supra). 
In still another aspect of the invention, the transcriptional control sequence further comprises 
at least one enhancer element. The term " enhancer" refers to a regulatory element which 
activates transcription in a position and orientation independent way. Several enhancer 
elements have been identified to date in many genes. For example, the enhancer element may 
be a myosin light chain enhancer. More preferably, the enhancer used in the expression 
cassette of the present invention is of vertebrate origin, more preferably of mammalian origin. 
The rat myosin light chain 1/3 enhancer (Donoghue et al., 1988, Gene & Dev., 2, 1779-1790) 
is especially useful. The enhancer element operably linked to the promoter may be localized 
either upstream or downstream of said promoter and may be used in either orientation. 
According to a preferred embodiment, the transcriptional control sequence comprises several 
enhancer sequences, the sequences of which are identical or selected independently of one 
another. Preferably, the transcriptional control sequence further comprises at least one 
sequence ensuring the polyadenylation of the transcribed RNA molecules. Such a sequence 
may be selected from the group consisting of the bGH (bovine growth hormone) 
polyadenylation signal (EP 173552), the SV40 polyadenylation signal and the globine 
polyadenylation signal, and is generally located at the 3'-end of the sequence encoding the 
protein, e.g. beta-interferon or RNA to be expressed. 

Moreover, according to another embodiment of the invention, an adenoviral vector 
according to the invention can, preferably in addition to the aforementioned genes, comprise a 
non-therapeutic gene coding for a protein which trans-activates non-adenoviral transcription. 
Naturally, the gene(s) of the El A region coding for a trans-activating protein, the expression 
of which would run the risk of rendering the adenovirus non-defective, will be avoided. 

In addition, the present invention relates to a method for producing a virus or virus- 
like particle comprising the steps of 

(a) transforming a suitable host cell with an adenoviral vector of the invention as 
described above; 

(b) culturing the transformed cell line under conditions suitable to allow formation 
of a virus or virus-like particle from said adenoviral vector; and 
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(c) recovering the virus or virus-like particle formed in step (b) from the culture. 

The adenoviral vector of the invention can be transformed into the cell in accordance 
with known techniques, such as microinjection into the cell nucleus (Capechi et al., 1980, Cell 
5 22, 479-488), transfection for example with CaP0 4 (Chen and Okayama, 1987, Mol. Cell 
Biol. 7, 2745-2752), electroporation (Chu et al., 1987, Nucleic Acid Res. 15, 1311-1326), 
lipofection/liposome fusion (Feigner et al., 19S7, Proc. Natl. Acad. Sci. USA 84, 7413-7417), 
particle bombardment (Yang et al., 1990, Proc. Natl. Acad. Sci. USA 87, 9568-9572), gene 
guns, infection (e.g. with a virus containing said adenoviral vector), direct DNA injection 

10 (Acsadi et al., 1991, Nature 352, 815-818), microprojectile bombardment (Williams et al., 
1991, Proc. Natl. Acad. Sci. USA 88, 2726-2730), or the like. 

With respect to cell line, both prokaryotic and eukaryotic cells may be employed, 
which include bacteria yeast, plants and animals, including human cells. Preferably, the 
adenoviral vector is replication-defective and said suitable host cell complements at least one 

15 defective function of said adenoviral vector, eventually in combination with a helper virus. 
The cell lines 293 (Graham et al., 1977, J. Gen. Virol. 36, 59-72) and PERC6 (Fallaux et al., 
1998, Human Gene Therapy 9, 1909-1917) are commonly used to complement the El 
function. Other cell lines have been engineered to complement doubly defective vectors (Yeh 
et al., 1996, J. Virol. 70, 559-565 ; Krougliak and Graham, 1995, Human Gene Ther. 6, 1575- 

20 15S6 ; Wang et al., 1995, Gene Ther. 2, 775-783 ; Lusky et al., 1998, J. Virol. 72, 2022-2033 ; 
EP919627 and WO97/041 19). 

According to one embodiment, the genome of the adenoviral vector lacks all or part of 
the sequence encoding a pIX (either wild type or modified), and the method of the invention 
employs preferably a host cell engineered to express a modified adenoviral pIX protein of the 

25 invention, and preferably a pIX protein which has been genetically modified to express a 
binding moiety within its C-terminal part . Such a cell line comprises either in a form 
integrated into the genome or in episome form a nucleic acid molecule encoding the modified 
adenoviral pIX protein of the invention. Of course, the nucleic acid molecule is placed under 
the control of appropriate translation^ and/or transcriptional regulatory elements to allow 

30 production of the modified pIX protein in said cell line. Preferably, this cell line is further 
capable of complementing one or more adenoviral functions selected from the group 
consisting of the functions encoded by the El, E2, E4, LI, L2, L3, L4, L5 regions or any 
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combination thereof. It is preferably produced from the 293 cell line or from the PER-C6 cell 
line complementing the El function by transfecting an expression vector encoding the 
sequence encoding the modified pIX protein. 

The virus or virus-like particle can be recovered from the culture supernatant but also 
5 from the cells after conventional lysis techniques (mechanical, enzymatic, freeze and thawing 
cycles) and optionally further purified according to standard techniques (e.g. 
ultracentrifugation, chromatography as described in W096/27677, WO98/00524 
WO98/26048 and WO00/50573). 

In a further embodiment, the invention relates to a method for producing a virus or 
10 virus-like particle comprising the steps of 

(a) transforming a suitable host cell with an adenoviral vector which encodes a 
wild-type pDC protein; 

(b) modifying the coding sequence for the pIX protein in the adenoviral vector so 
that the encoded pIX protein is one according to the present invention; 

15 (c) culturing the host cell under conditions suitable to allow formation of a virus or 

virus-like particle from the adenoviral vector of step (b); and 
(d) recovering the virus or virus-like particle formed in step (c) from the culture. 

Preferably, this method includes the production in cell culture of a virus or virus-like 
particle which has been genetically modified to express a binding moiety on its surface, 

20 whereby this binding moiety is comprised by the modified pIX protein of the invention. The 
virus or virus-like particle is grown in its host prior to modification, but once the modification 
that alters the binding specificity is made, the virus or virus-like particle is grown in the target 
cell. Thus, for example in the case where the binding moiety recognises a breast tumour cell 
antigen, the modified virus or virus-like particle is grown in breast tumour cell culture 

25 expressing that antigen. 
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In a further embodiment, the present invention relates to viruses or virus-like particles 
comprising the above-described adenovirus pIX protein, the nucleic acid molecule or the 
adenoviral vector of the invention or are obtainable by a method for producing a virus or 
virus-like particle as mentioned hereinabove. 

5 Advantageously, the virus or virus-like particle of the invention is furthermore 

substantially incapable of binding its host cell, i.e. incapable of binding the natural host cell of 
the wild-type virus or virus-like particle from which said virus or virus-like particle is derived. 
The term "derived" means in this context that the wild-type virus or virus-like particle 
structurally corresponds to the virus or virus-like particle of this embodiment of the invention, 

10 except for modifications rendering it suitable for gene delivery, for instance by deleting non- 
essential parts of the viral genome and/or inserting a heterologous sequence to be expressed, 
and for the modification of the pIX protein and its coding sequence as described herein. By "a 
virus or virus-like particle substantially incapable of binding its host cell", a modified virus is 
meant that has no more than 20%, preferably no more than 10%, more preferably no more 

15 than 5%, and even more preferably no more than 1% of the binding affinity of the unmodified 
virus for the host cell. This lack of binding can be obtained for example by modifying the 
fiber protein as discussed above and disclosed in the prior art (see for example US 5,756,086, 
US 5,885,808, EP 991 763, PCT FR00/01559 or PCT FROO/03263). 

In accordance with a preferred embodiment of the present invention, the virus or 
20 virus-like particle comprises a modified binding specificity conferred by a pIX moiety 
modified in the sense that it comprises a binding moiety in order to allow the virus or virus- 
like particle to bind to a target cell. Preferably, said virus or virus-like particle is furthermore 
substantially incapable of binding its host cell. 

In order to differentiate the cells for which the virus or virus-like particle of the 
25 invention shows specificity in respect to the capacity to infect, a distinction is made herein 
between "host cell" and "target cell". However, further below, for instance in those parts of 
the description where the function of an exogenous nucleotide sequence in the target cell is 
described, this distinction may not apply. Accordingly, in contrast to "target cell", the term 
"host cell" refers to the cells to which an unmodified, naive virus can bind by using its 
30 receptor-like molecule and the cognate receptor-like molecule on the surface of the cell. The 
term "target cell" refers to cells to which the modified virus can bind by using its binding 
moiety. In some circumstances, in the context of this aspect of the invention, such as when the 
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binding moiety recognises an entity on the host cell which is not the cognate receptor-like 
molecule, then the host cell may be the target cell. 

In a preferred embodiment of the invention, the target cell is a eukaryotic, especially 
mammalian cell, and it is expected that the invention will find uses in the areas of gene 
5 therapy and cancer treatment. 

It is also preferred that the virus or virus-like particle is "replication-defective". By 
"replication defective", a virus is meant whose genetic material has been manipulated so that 
it cannot divide or proliferate in the cell it infects. 

The binding moiety of the virus or virus-like particle of the invention provides the 
10 target cell binding specificity. 

In one preferred embodiment, the virus or virus-like particle is an adenovirus. 
Preferentially, in such a virus or virus-like particle, the E1B gene is substantially deleted or 
modified so that its gene product no longer interacts with the E1A protein. E1A protein 
stimulates apoptosis but normally its action is inhibited by E1B. Conveniently, the E1B gene 
15 is inactivated by insertion; preferably a cytotoxic gene, as defined below, is inserted at or near 
the E1B gene. 

El, E3 and a site upstream of E4 may be used as sites for insertion of foreign DNA 
sequences in the generation of recombinant adenoviruses (for example see Berkner and Sharp 
(1984) Nucl. Acids Res. 12, 1925-1941; Chanda et al (1990) Virology 175, 535-547; Haj- 

20 Ahmad and Graham (1986) J. Virol. 57, 267-274; Saito et al (1985) J. Virol. 54, 711-719; all 
incorporated herein by reference). Since the upper size limit for DNA molecules that can be 
packaged into adenovirus particles is approximately 105% of the wild-type genome, only 
about 2 kb of extra DNA can be inserted without compensating deletions of viral DNA. 
Although El is essential for virus replication in cell culture, foreign DNA can be substituted 

25 for El sequences when the virus is grown in 293 or PERC6 cells which are transformed by 
Ad5 DNA and constitutively express El (Graham et al (1977) J. Gen. Virol. 36, 59-72 and US 
6,033,908, incorporated herein by reference). Several vectors having 1.9 kb deleted from E3 
of Ad5 have been constructed without interfering with virus replication in cell culture 
(reviewed by Graham and Prevec (1992) in "Vaccines: New Approaches to Immunological 

30 Problems" R. W. Ellis (Ed.), Butterworth-Heinemann, Boston, Mass., pages 364-390, 
incorporated herein by reference). Such vectors allow for insertion of up to 4 kb of foreign 
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DNA. Recombinant adenoviruses containing inserts in E3 replicate in all Ad-permissive cell 
lines and a number of adenovirus vectors containing E3 inserts have been shown to express 
foreign genes efficiently both in vitro and in vivo. As mentioned above, the adenovirus or 
adenovirus-like particle can also be modified (e.g. in the fiber, penton base and/or hexon), so 
5 as to substantially reduce or abolish the binding to the cellular receptors which normally 
mediate attachement and entry of a wild-type adenovirus and adenovirus-like particles to its 
host cell. 

Substantially replication-defective adenoviruses may be made by creating a deficiency 
of the E1A protein. Suitably this is achieved by deleting the E1A gene or by making 
10 mutations within the El A gene that prevent expression of the El A protein. Examples of 
suitable mutations are deletions within the E1A coding region; nonsense mutations; and 
frameshift mutations. 

The viruses or virus-like particles of the invention can be propagated in a 
complementation cell line providing in trans the missing function which is essential to viral 

15 replication (encoded by El, E2 and/or E4 regions). Widely used complementation line are the 
embryonic kidney line 293 (Graham et al.,1977, J. Gen. Virol., 36, 59-72), 911 cells 
(deposited under no 95062101 at the ECACC - Fallaux et al. (1996) Hum. Gene Ther. 7, 215- 
222) or the embryonic retinal line PERC6 (ECACC N°. 96022940) cells. PER.C6 cells is the 
abbreviation of PGK-E1 Retinoblasts. C6 is the clone number. PGK - El reflects that the 

20 adenovirus type 5 E1A region in the construct is driven by the human PGK promoter. PER 
cells are Human Embryo Retinoblasts transformed with El sequences (nt 459-3510) of human 
adenovirus type 5 (WO 97/00326). 

Cells useful for propagating the virus or virus-like particle of the invention may 
additionally be derivatives of existing cell lines, e.g., from 293 or PER.C6 cell lines. Such 
25 derivative cells should express the genes necessary to complement in trans deletions in an 
adenoviral genome or should support replication of an otherwise defective adenoviral vector. 
Such cells may for example express the El, E2, E4, and/or late functions (see for example EP 
919 627, US 6,040,174 , US 6,133,028, US 6,033,908 or US 5,994,128). 

For the purposes of the present invention, the term "deletion" or "lacking" refers to the 
30 elimination of at least one nucleotide in the target region, and the deletion can naturally be 
continuous or discontinuous. Generally speaking, such a deletion results in a measurable 
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change : for example, the deletion results in impairing or improving the function of the 
product encoded by the genetic material bearing the deletion compared with the 
corresponding genetic material not bearing the deletion. For example, a deletion in the El 
region results in lack of adenoviral replication. These terms are widely used by those skilled 
5 in the art. "All or part" of the regions of the adenoviral genome may be deleted which means 
that either the whole or only a portion of the region in question is removed. Deletions are 
preferred which prevent the production of at least one expression product encoded by the said 
region. Hence they may he in a coding region or a regulatory region such as the promoter 
region, and may affect at least one nucleotide so as to destroy the reading frame of a gene or 
10 render a promoter region non-functional. The deletions in question may also comprise partial 
deletions of one or more genes of the said region or of the whole of the region. 

The term "virus or virus-like particle" as used herein is synonymous with "adenoviral 
capsid" which, in turn, is a general term designating both "adenoviral particle" and 
"adenoviral pseudo particle". "Adenoviral particle" concerns an "adenoviral genome" 

15 (recombinant or wild type) associated with viral polypeptides forming what is usually called 
an adenovirus, or forming a complex where the nucleic acid, while being associated with the 
viral polypeptides, is not included into a viral element such as a viral capsid (see US 
5,928,944 and WO 9521259). On the opposite, "non-naked" means that said nucleic acid 
may be associated (i) with viral polypeptides forming what is usually called a virus 

20 (adenovirus, retrovirus, poxvirus, etc..) or forming a complex where the nucleic acid while 
being associated with viral polypeptides is not included into a viral element such as a viral 
capsid (see US 5,928,944 and WO 9521259). An adenoviral particle according to the 
invention may be prepared by passage in any complementation line providing in trans the 
functions for which an adenoviral vector according to the invention is defective, for example 

25 line 293 of the prior art. These preparation techniques are known to a person skilled in the art 
(Graham and Prevec, 1991, Methods in Molecular Biology, vol. 7, 109-128, Ed: E. J. Murey, 
The Human Press Inc.). 

Furthermore, the virus or virus-like particle of the invention can be of use for 
transferring non- viral macromolecules into a target cell. There are two means by which such 
30 transfer can be effected. First, the adenoviral vector be employed to transfer non-viral 
macromolecules packaged within the adenoviral vector either in place of, or in addition to, 
normal adenoviral components (Berkner, K. L., BioTechniques, 6, 606-629 (1988)). For 
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example, the genome of the adenovirus can be modified to incorporate an exogenous 
nucleotide sequence. The recombinant adenovirus is then packaged to constitute an infectious 
virus capable of entering cells and transferring the exogenous nucleotide sequence to the 
nucleus (Rosenfeld et al., Science, 252, 431-434 (1991); Rosenfeld et ah, Cell, 68, 143-155 
5 (1992); Quantin et al., Proc. Natl. Acad. Sci., 89, 2581-2584 (1992); Berkner, K. L., 
BioTechniques, 6, 606-629 (1988)). 

Second, the virus or virus-like particle can be employed to mediate the transfer of non- 
viral macromolecules either linked to the surface of the adenoviral vector or, in a "bystander" 
process where the macromolecule is cointernalized, taken along as cargo in the adenoviral 
10 receptor-endosome complex (Otero et al., Virology, 160, 75-80 (1987); FitzGerald et al., Cell, 
32, 607-617 (1983); Seth et al., Mol. Cell Biol., 4, 1528-1533 (1984); Yoshimura, Cell Struct. 
Funct., 10, 391-404 (1985); Defer et al., J. Virol., 64, 3661-3673 (1990)). 

The mechanism by which an adenovirus may augment internalization of non-viral 
biologic material is believed to be by increasing the permeability of the target cell plasma 

15 membrane (Otero et al., Virology, 160, 75-80 (1987)) or, more likely, by cointernalization of 
the exogenous biologic material as an "innocent bystander" when the adenovirus-receptor 
complexes cluster on the membrane and are internalized (FitzGerald et al., Cell, 32, 607-617 
(1983); Seth et al., Mol. Cell Biol., 4, 1528-1533 (1984); Yoshimura, Cell Struct. Funct., 10, 
391-404 (1985); Otero et al., Virology, 160, 75-80 (1987); Defer et al., J. Virol., 64, 3661- 

20 3673 (1990)). These processes are not adenoyirus-specific, as similar phenomena have been 
observed with other non-enveloped viruses such as picornavirus (Fernandez-Puentes et al., 
Cell, 20, 769-775 (1980); Otero et al., Virology, 160, 75-80 (1987); Carrasco, Virology, 113, 
623-629 (1981)), as well as enveloped viruses including paramyxovirus, rhabdovirus, 
poxvirus, and togavirus (Fernandez-Puentes et al., Cell, 20, 769-775 (1980); Otero et al., 

25 Virology, 160, 75-80 (1987); Yamaizumi et al., Virology, 95, 216-221 (1979); Carrasco et al., 
Virology, 117, 62-69 (1982)). 

Most of the research attention on virus-mediated cointernalization of macromolecules 
has been focused on cointernalization of proteins, including toxins and various reporter 
proteins (Ferna-Puentes et al., Cell, 20, 769-775 (1980); FitzGerald et al., Cell, 32, 607-617 
30 (1983); Seth et al., Mol. Cell Biol., 4, 1528-1533 (1984); Otero et al., Virology, 160, 75-80 
(1987); Defer et al., J. Virol., 64, 3661-3673 (1990); Carrasco, Virology, 113, 623-629 
(1981); Yamaizumi et al., Virology, 95, 216-221 (1979); Carrasco et al., Virology, 117, 62-69 
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(1982)). The concept that cointernalization might be employed for adeno virus-mediated 
transfer of nucleic acids was suggested, but not evaluated, by Otero and Carrasco (Otero et al., 
Virology, 160, 75-80 (1987)). In fact, the more recent approaches with respect to the transfer 
of nucleic acids using an adenovirus have centered on nucleic acid transfer by attachment of 
the nucleic acid to molecules capable of effecting its entry into the cell. For instance, in one 
approach, the nucleic acid may be part of a polylysine-glycoprotein carrier complex capable 
of binding a particular cell surface receptor, or is complexed with a nonspecific ligand such as 
a charged polypeptide (Rosenfeld et al., Science, 252, 431-434 (1991); Curiel et al., Proc. 
Natl. Acad. Sci., 88, 8850-8854 (1991); Rosenfeld et al., Cell, 68, 143-155 (1992); Quantin et 
al., Proc. Natl. Acad. Sci., 89, 2581-2584 (1992); Curiel et al., Hum. Gene Therapy, 3, 147- 
154 (1992)); Cotten et al., Proc. Natl. Acad. Sci., 89, 6094-098 (1992); Cotten et al., J. 
Virology, 67, 3777-3785 (1993)). In a more recent approach, the nucleic acid may be attached 
to the outside of an adenoviral capsid by means of conjugation of the nucleic acid through a 
polylysine residue to an antibody having affinity to an adenoviral capsid protein (Curiel et al., 
Human Gene Ther., 3, 147-154 (1992)). Thus, despite this early suggestion by Otero et al., 
researchers have clearly perceived a lack of feasibility of using adenovirus-driven RME for 
the transfer of nucleic acids. 

Moreover, the uses of the virus or virus-like particle for transferring nucleic acids are 
limited by the specific receptor to the ligand employed, i.e. the specific receptor must be 
present on the cell surface for transfection to be accomplished. Additionally, it was discovered 
recently that better transfection results are obtained when the DNA is not physically attached 
to any molecule upon introduction into the cell (Wolff et al., Science, 247, 1465 (1990); 
Acsadi et al., Nature, 352, 815 (1991)). This finding underscores the restrictive nature of 
current approaches of adenovirus-mediated transfer of DNA to the cell, which require the 
attachment of DNA to one or more other moieties for cell transfection. 

The invention also relates to a eukaryotic host cell comprising an adenovirus pIX 
protein, nucleic acid molecule or adenoviral vector according to the invention or being 
infected with a virus or virus-like particle of the invention. Said cell is advantageously a 
mammalian cell, and preferably a human cell, and can comprise said vector in integrated form 
in the genome, or preferably in non-integrated (episome) form. 

For the purpose of the invention, the term " eukaryotic host cell" should be understood 
broadly without any limitation concerning particular organization in tissue, organ, etc or 
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isolated cells of a mammalian (preferably a human). Such cells may be unique type of cells or 
a group of different types of cells and encompass cultured cell lines, primary cells and 
proliferative cells from mammalian origin, with a special preference for human origin. 
Suitable host cells include but are not limited to hematopoietic cells (totipotent, stem cells, 
5 leukocytes, lymphocytes, monocytes, macrophages, APC, dendritic cells , non-human cells 
and the like), pulmonary cells , tracheal cells, hepatic cells, epithelial cells, endothelial cells, 
muscle cells (e.g. skeletal muscle, cardiac muscle or smooth muscle), fibroblasts. 

The present invention also relates to a complementation cell line suitable for 
producing the virus or virus-like particle of the invention or for applying the method for 
10 producing such a virus or virus -like particle as described above. 

Such a complementation line may contain a complementation element comprising, in 
particular, a portion of the El region of the genome of an adenovirus with the exception of the 
5 1 ITR; said complementation element being capable of complementing in trans a defective 
15 adenoviral vector and being integrated in the genome of said complementation line or inserted 
into an expression vector. In particular, the eukaryotic host cell of the invention is a cell of a 
complementation line, in particular when the adenoviral vector contained in the host cell is 
defective. 

In the context of the present invention, the term "complementation line" refers to a 
20 eukaryotic cell capable of providing in trans the function(s) for which an adenoviral vector is 
defective. In other words, it is capable of producing the protein or proteins needed for the 
replication and encapsidation of said adenoviral vector, early and/or late proteins which it 
cannot itself produce and which are needed for building a viral particle. Naturally, said 
portion may be modified by mutation, deletion and/or addition of nucleotides, as long as these 
25 modifications do not impair its capacity for complementation. Thus, for example an 
adenoviral vector which is defective for the El function will have to be propagated in a 
complementation line for El (i.e. capable of providing in trans the protein or set of proteins 
encoded by the El region which the vector cannot produce), a vector which is defective for 
the El and E4 functions will be propagated in a complementation line for El and E4 
30 (providing the necessary proteins encoded by the El and E4 regions), and lastly a vector 
which is defective for the El, E2 and E4 functions will be propagated in a complementation 
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line for the three functions. As mentioned in the introduction, the E3 region is non-essential, 
and does not need to be specifically complemented. 

A complementation line according to the invention may be derived either from an 
immortalized cell line capable of dividing indefinitely, or from a primary line. In accordance 
5 with the objectives pursued by the present invention, a complementation line according to the 
invention is useful for the encapsidation of any defective adenoviral vector, and especially a 
defective adenoviral vector according to the invention. 

The present invention further relates to a pharmaceutical composition comprising as 
therapeutic and/or prophylactic agent an adenoviral vector, a virus or virus-like particle, a 
10 eukaryotic host cell or a complementation cell line according to the invention, in combination 
with a vehicle or carrier which is acceptable from a pharmaceutical standpoint. It is preferred 
that said agent is capable of expressing a therapeutically useful gene such as those enumerated 
above. 

The pharmaceutical composition of the invention can be administered by any suitable 

15 route. Administration into vertebrate target tissues, and more specifically into the muscle, can 
be performed by different delivery routes (systemic delivery and targeted delivery). 
According to the present invention, the pharmaceutical composition is preferably 
administered into skeletal muscle, however administration can also occur in other tissues of 
the vertebrate body for instance including those of non skeletal muscle, skin, brain, lung, 

20 liver, spleen, bone marrow, thymus, heart, lymph, bone, cartilage, pancreas, kidney, gall 
bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, 
connective tissue, blood, tumor. Similarly, in order to direct expression of the therapeutically 
useful gene to the intended site of action, the nucleic acid can be associated with targeting 
molecules which are capable to direct its uptake into targeted cells. Gene therapy literature 

25 provides many mechanisms for efficient and targeted delivery and expression of genetic 
information within the cells of a living organism. Administration of the pharmaceutical 
composition may be made by intradermal, subdermal, intravenous, intramuscular, intranasal, 
intracerebral, intratracheal, intraarterial, intraperitoneal, intravesical, intrapleural, 
intracoronary or intratumoral injection, with a syringe or other devices. Transdermal 

30 administration is also contemplated, as are inhalation or aerosol routes. Injection, and 
specifically intratumoral, intravenous or intramuscular injection, is preferred. 
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The virus or virus-like particles of the invention may be administered in any suitable 
way, usually parenterally, for example intravenously, intrapeiitoneally or intravesically, in 
standard sterile, non-pyrogenic formulations of diluents and carriers, for example isotonic 
saline (when administered intravenously). 

5 The pharmaceutical composition can be designed or used for repeated administrations 

to the patient without major risk of the administered pharmaceutical composition to induce a 
significant immune reaction. Administration may be by single or repeated dose, once or 
several times after a certain period of time. Repeated administration allows a reduction in the 
dose of the pharmaceutical composition administered at a single time. The route of 
10 administration and the appropriate dose vary in function of several parameters, for example 
the individual patient, the side effects of the disease, or the albumin level before treatment. 

The administered volume preferably varies from about 10 :1 to 500 ml, most preferably 
from about 100:1 to 100 ml. The administered volume can be adapted depending on the 
administration route, the treated patient and the patient's weight. 

15 

The pharmaceutical composition according to the invention is intended especially for 
the preventive or curative treatment of disorders such as: genetic disorders such as 
hemophilia, cystic fibrosis or Duchene's and Becker type myopathies, cancers such as those 
induced by oncogenes or viruses, retroviral diseases such as AIDS, (acquired 

20 immunodeficiency syndrome resulting from HTV infection), and recurrent viral diseases such 
as herpes virus-induced infections. The composition of the present invention is particularly 
intended for the preventive or curative treatment of disorders, conditions or diseases 
associated with cancer. The term "cancer" encompasses any cancerous conditions including 
diffuse or localized tumors, metastasis, cancerous polyps and preneoplastic lesions (e.g. 

25 dysplasies) as well as diseases which result from unwanted cell proliferation. A variety of 
tumors may be selected for treatment in accordance with the composition of the invention. In 
general, solid tumors are preferred, although leukemias and lymphomas may also be treated 
especially if they have developed a solid mass, or if suitable tumor associated markers exist 
such that the tumor cells can be physically separated from nonpathogenic normal cells. For 

30 example, acute lymphocytic leukemia cells may be sorted from other lymphocytes with the 
leukemia specific marker "CALLA". Cancers which are contemplated in the context of the 
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invention include without limitation glioblastoma, sarcoma, melanomas* mastocytoma, 
carcinomas (e.g. colorectal and renal cell carcinomas) as well as breast, prostate, testicular, 
ovarian, cervix (in particular, those induced by a papilloma virus), lung (e.g. lung carcinomas 
including large cell, small cell, squamous and adeno-carcinomas), kidney, bladder, liver, 
colon, rectum, pancreas, stomach, esophagus, larynx, brain, throat, skin, central nervous 
system, blood (lymphomas, leukemia, etc.), bone, etc cancers. 

A pharmaceutical composition according to the invention may be manufactured in a 
conventional manner. In particular, a therapeutically effective amount of a therapeutic or 
prophylactic agent is combined with a vehicle such as a diluent. A composition according to 
the invention may be administered by aerosol or via any conventional route in use in the field 
of the art, especially via the oral, subcutaneous, intramuscular, intravenous, intraperitoneal, 
intrapulmonary or intratracheal route. The administration may take place in a single dose or a 
dose repeated one or more times after a certain time interval. The appropriate administration 
route and dosage vary in accordance with various parameters, for example with the individual 
being treated or the disorder to be treated, or alternatively with the gene(s) of interest to be 
transferred. Generally speaking, a pharmaceutical composition according to the invention 
comprises a dose of adenovirus according to the invention of between 10 4 and 10 14 , 
advantageously 10 5 and 10 13 and preferably 10 6 and 10 11 . A pharmaceutical composition, 
especially one used for prophylactic purposes, can comprise, in addition, an adjuvant which is 
acceptable from a pharmaceutical standpoint. 

The invention also encompasses a method of treatment, according to which a 
therapeutically effective amount of an adenoviral vector, a virus or virus-like particle, a 
eukaryotic host cell or a complementation cell line according to the invention is administered 
to a patient requiring such treatment. Preferably, said treatment is by gene therapy which may 
be in vivo or ex vivo gene therapy. Accordingly, the invention relates to the use of an 
adenoviral vector, a virus or virus-like particle, a eukaryotic host cell or a complementation 
cell line according to the invention for the preparation of a pharmaceutical composition for 
prophylaxis, treatment and/or vaccination of a patient in need of such prophylaxis, treatment 
and/or vaccination. Preferably, said adenoviral vector, virus, or virus-like particle, host cell or 
complementation cell line is capable of expressing a therapeutically useful gene. 

In a preferred embodiment, the method according to the present invention involves 
administration into a fluid vessel, such as for example a blood vessel or a lymph vessel, and 
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can be advantageously improved by combining injection in an afferent and/or efferent fluid 
vessel with increases of permeability of said vessel. In a particularly preferred embodiment, 
said increases is obtained by increasing hydrostatic pressure (i.e. by obstructing outflow 
and/or inflow), osmotic pressure (with hypertonic solution) and/or by introducing a 
5 biologically active molecule (e.g. histamine into administered composition) (see, e.g., WO 
98/58542). 

In addition to the prophylactic and/or therapeutical agents, pharmaceutical 
compositions of the invention may contain a pharmaceutically acceptable carrier. 

"Pharmaceutically acceptable carrier" allows use of the pharmaceutical composition in 

10 a method for the therapeutic treatment of humans or animals. In this particular case, the 
carrier can be a pharmaceutically suitable injectable carrier or diluent (for examples, see 
Remington's Pharmaceutical Sciences, 16 th ed. 1980, Mack Publishing Co). Such carrier or 
diluent is pharmaceutically acceptable, i.e. is non toxic to a recipient at the dosage and 
concentration employed. It is preferably isotonic, hypotonic or weakly hypertonic and has a 

15 relatively low ionic strength, such as provided by a sucrose solution. Furthermore, it may 
contain any relevant solvents, aqueous or partly aqueous liquid carriers comprising sterile, 
pyrogen-free water, dispersion media, coatings, and equivalents, or diluents (e.g. Tris-HCl, 
acetate, phosphate), emulsifiers, solubilizers or adjuvants. The pH of the pharmaceutical 
preparation is suitably adjusted and buffered in order to be useful in in vivo applications. It 

20 may be prepared either as a liquid solution or as a solid form (e.g.lyophilized) which can be 
suspended in a solution prior to administration. Representative examples of carriers or 
diluents for injectable formulation include water, isotonic saline solutions which are 
preferably buffered at a physiological pH (such as phosphate buffered saline or Tris buffered 
saline), mannitol, dextrose, glycerol and ethanol, as well as polypeptides or protein such as 

25 human serum albumin. For example, such formulations comprise the pharmaceutical 
composition prepared according to the use of the present invention in 10 mg/ml mannitol, 1 
mg/ml HSA, 20 mM Tris pH 7.2 and 150 mM NaCl. Another preferred formulation 
comprises 1M sucrose, 150 mM NaCl , ImM MgCI 2 , 54 mg/1 Tween 80, 10 mM Tris pH 8.5. 

In accordance with these therapeutical aspects of the invention, a preferred 
30 embodiment relates to a method of delivery of the virus or virus-like particle of the invention, 
preferably, which contains a gene encoding a molecule having an indirectly cytotoxic 
function. 
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Suitably, the indirectly cytotoxic function refers to an enzyme that converts a prodrug 
to a toxic drug. With such a virus or virus-like particle, once the virus or virus-like particle 
has bound to the target cells, delivered its nucleic acid to the cells, and expressed the 
indirectly cytotoxic functions, which typically takes a day or so, the pro-drug is administered. 
5 The timing between administration of the virus or virus-like particle and the pro-drug may be 
optimised in a non-inventive way. 

The dosage of the pro-drug will be chosen by the physician according to the usual 
criteria. The dosage of the virus or virus-like particle will similarly be chosen according to 
normal criteria, and in the case of tumour treatment, particularly with reference to the type, 
10 stage and location of tumour and the weight of the patient. 

The duration of treatment will depend in part upon the rapidity and extent of any 
immune reaction to the virus or virus-like particle. 

Some of the viruses or virus-like particles either in themselves, or together with an 
appropriate pro-drug, are in principle suitable for the destruction of cells in any tumour or 
15 other defined class of cells selectively exhibiting a recognisable (surface) entity. Examples of 
types of cancer that may be treated using the viruses or virus-like particles are those cited 
above, and especially cancers of the breast, prostate, colon, rectum, ovary, testicle and brain. 
The compounds are principally intended for human use but could be used for treating other 
mammals including dogs, cats, cattle, horses, pigs and sheep. 

20 In a further preferred embodiment, the invention pertains to a pharmaceutical 

composition which comprises, in addition to the compounds mentioned above such as an 
adenoviral vector expressing a therapeutically useful gene as for example beta-interferon, at 
least one adjuvan* capable of improving the transfection capacity or gene expression in the 
cell. Such an adjuvant can be selected from the group consisting of chloroquine, protic 

25 compounds such as propylene glycol, polyethylene glycol, glycerol, ethanol, 1 -methyl L-2- 
pyrrolidone or derivatives thereof, aprotic compounds such as dimethylsulfoxide (DMSO), 
diethylsulfoxide, di-n-propylsulfoxide, dimethylsulfone* sulfolane, dimethyl-formamide, 
dimethylacetamide, tetramethylurea, acetonitrile or derivatives. The composition may also 
advantageously comprise a source of a cytokine which is incorporated in the form of a 

30 polypeptide or as a polynucleotide encoding the cytokine. Preferably, said cytokine is 
interleukin 10 (IL-10)(EP-A-967 289). The therapeutic composition can further comprise a 
nuclease inhibitor such as actine G, or specific magnesium or lithium concentrations. 
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Moreover according to a specific embodiment, the pharmaceutical composition 
comprises transformed a host cell which preferably may be a human muscular cell which is 
further encapsulated. Cell encapsulation methodology has been previously described which 
5 allows transplantation of encapsulated cells in treatment of Parkinson's disease (Tresco et al., 
1992, ASAIO J., 38, 17-23) or amyotrophic lateral sclerosis (Aebischer et al., 1996, Hum. 
Gene Ther. , 7, 851-860). According to said specific embodiment, transformed cells are 
encapsulated by compounds which form a microporous membrane* and said encapsulated 
cells can further be implanted in vivo. Capsules, for example approximately 1 cm in length 

10 containing the cells of interest may be prepared employing a hollow microporous membrane 
fabricated from poly-ether-sulfone (PES) (Akzo Nobel Faser AG, Wuppertal, Germany ; 
Deglon et al, 1996, Hum. Gene Ther., 7, 2135-2146). This membrane has a molecular weight 
cutoff greater than l,000,OOODa, which permits the free passage of proteins and nutrients 
between the capsule interior and exterior, while preventing the contact of transplanted cells 

15 with host cells. The entrapped cells may be implanted by intradermal, subdermal, intravenous, 
intramuscular, intranasal, intracerebral, intratracheal, intraarterial, intraperitoneal, intravesical, 
intrapleural, intracoronary or intratumoral ways. In case where said transformed host cell is a 
myoblast, it can migrate from the site of injection to muscles where expression of the gene of 
interest (e.g. beta-interferon) can occur. 

20 While the present invention has been described with reference to preferred 

embodiments and specific examples, one of the ordinary skill after reading the foregoing 
specification will be able to effect various changes, substitutions of equivalents, and other 
alterations to the processes and produced cells set forth herein. It is therefore intended that the 
protection claimed hereon be limited only by the definition contained in the appended claims 

25 and equivalents thereof. 

These and other embodiments are disclosed or are obvious from and encompassed by 
the description and examples of the present invention. Further literature concerning any one 
of the methods, uses and compounds to be employed in accordance with the present invention 
may be retrieved from public libraries, using for example electronic devices. For example the 
30 public database "Medline" may be utilized which is available on Internet, e.g. under 
http://www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and addresses, such as 
http://www.ncbi.nlm.nih.gov, http://www.infobiogen.fr, 
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http://www.j^.ch/biology/re^ http://www.tigr.org, are known to the person 

skilled in the art and can also be obtained using, e.g., http://www.Iycos.com. An overview of 
patent information in biotechnology and a survey of relevant sources of patent information 
useful for retrospective searching and for current awareness is given in Berks, TIBTECH 12 
5 (1994), 352-364. 

The invention has been described in an illustrative manner, and it is to be understood 
that the terminology which has been used is intended to be in the nature of words of 
description rather than of limitation. Obviously, many modifications and variations of the 
present invention are possible in light of the above teachings. It is therefore to be understood 
10 that within the scope of the appended claims, the invention may be practiced different from 
what is specifically described herein. 

All of the above cited disclosures of patents, publications and database entries are 
specifically incorporated herein by reference in their entirety to the same extent as if each 
such individual patent, publication or entry were specifically and individually indicated to be 
15 incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIGURE LEGENDS 
Figure 1 

20 Conserved sequence elements in pDC Aminoacid sequence alignments (Clustal X) of 

pIX from several human (top 8 sequences) and animal (bovine -b-, porcine -p- and canine -c-) 
Ad serotypes, as indicated on the left, were performed. Accession numbers are: Ad2 
(p03282), Ad5 (p032S5), Ad3 (JO 1962), Ad7 (03283); Ad9 (q9yl97); Adl2 (03284); Ad40 
(p48312); Ad41 (p32539); Ad2b (q65377); Ad3p ( q 9w9x3); Adlc (q65944); Ad2c (pl4268). 

25 Dots correspond to gaps inserted by the program to optimise alignments. Conserved sequence 
elements are boxed. and " A " (bottom) denote identical and related amino acid residues, 
respectively. Numbers in parentheses (top) refer to coordinates of amino acids in human Ad2 
and Ad5 pIX, relative to the starting methionine. 

30 
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Figure 2 

Schematic representation of specific subdomains of pIX. (A) The conserved pEX 
sequence domains, including the central human-specific poly-alanine stretch are represented 
as boxes, with relevant peptidic elements and coordinates from the Ad2 or Ad5> wt sequence. 
5 Point mutations or small deletions are indicated above, while larger deletions as mentioned in 
the text are depicted below. (B) The predicted helical-wheel representation of the C-terminal 
leucine-repeat of human Ad2 or Ad5 pIX, from residues 100 [first L at position (a)] to 134 [S 
at position (g)] is shown next to a symetrically positioned wheel of the same region. The 
potential hydrophobic interactions [(a)/(d') and (d)/(a f )] between the 2 helices is suggested by 
10 the alignments of the residues at positions (a) and (d) with (d') and (a 1 ), respectively, in a 
putative coiled-coil structure. The charged residues ("+" or "-") which presumably stabilise 
these interactions are indicated. Residues (with corresponding coordinates) that are altered by 
site-directed mutagenesis are bold-faced. 

Figure 3 

15 The integrity of the conserved N-terminal domain of Ad2 pIX is required for capsidic 

incorporation. CsCl-purified Ad5 El° virions expressing wt pIX, no pIX (El° IX°) or specific 
pIX variants (as indicated on the top; "L-V" means LI 14P+V1 17D) were disrupted by boiling 
in SDS sample buffer, fractionated by 10%-SDS PAGE and analysed by immunoblotting 
using polyclonal anti-pIX antibody (even lanes named V). Extracts were prepared (44) from 

20 293 cells that had been infected by the same viruses (MOI of 20 PFU per cell) and collected 
36 h pi. Aliquots were analysed by immunoblotting (odd lanes named "e"), next to the 
corresponding virions. The expected position of pIX is indicated. 

Figure 4 

The leucine-repeat and central hinge region are crucial for Ad2 pIX transcriptional 
25 activity. (A) Structure of the chimeric pEla-CAT reporter plasmid in which the promoter 
region of the Ad5 El a transcription unit was fused to the CAT gene. (B) A549 cells were 
transfected with 1 |ig of the pEla-CAT reporter plasmid, either alone (column 1) or together 
with plasmids expressing the wild type or mutated pIX as F-tagged fusion proteins: F/IX 
derivatives (0.1 fig; columns 2 and 3-14, respectively) or IX/F derivatives (0.5 j-ig; columns 
30 15 and 16, respectively). Cells were collected 36 h later, and extracts were prepared. Relative 
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CAT activities (means from 3 independent experiments) are represented with corresponding 
standard deviations. 

Figure 5 

Self-interaction of pDC A549 cells were transfected with vectors expressing the wt 
pIX, together with vectors expressing N-terminally F-tagged wt or mutated pIX proteins, as 
indicated. Cell extracts were prepared and aliquots were precipitated (IP) with monoclonal 
antibodies directed against the F epitope. The immunoprecipitates were submitted to Western- 
blot analysis (WB), probing first with the anti-F monoclonal antibody. After exposure, the 
same blot was washed and reprobed with polyclonal antibodies against pIX. The position of 
bands corresponding to the wt and F-tagged pIX are indicated. The bands marked by the 
asterisk likely correspond to proteolytic breakdown products of the F-tagged derivatives. The 
position of the immunoglobulin heavy subunit [IgG(H)] is indicated to show that equal 
amounts of antibody were used in the IP reaction. 

Figure 6 : Sequence of the Adenoviral genome 5 as disclosed in the GenBank data 
bank under the reference M73260. 

Figure 7 illustrates that pIX protein actively induces specific nuclear inclusions. (A) 
Ad-2 infected A549 cells at intermediate stage of nuclear transformation (14-16h pi) were 
processed for immunogold labelling with anti-pIX polyclonal antibody on Lowicryl sections 
of formaldehyde-fixed cells ; fibrillo-granular network (fg), c.a. inclusions (star), viral single 
strand DNA (a), cytoplasm (c). Bar 0.5(im. (B) Late stage of nuclear transformation (24-30 h 
pi) ; c.a. inclusions (star), electron-translucent region (e), perinuclear layer of host chromatin 
(ch), viral region (vr), virus (v), cytoplasm (c). Bar 0.5|im. (C) Overexpression of 
recombinant pIX protein induces tha accumulation of the protein within newly formed c.a. 
inclusions; c.a. inclusions (star), perinuclear layer of condensed chromatin (ch), nucleolus 
(nu), Bar 0.5|j.m. 
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The examples which follow illustrate only one embodiment of the present invention. 

The constructions described below are carried out according to the general techniques 
of genetic engineering and molecular cloning as detailed in Maniatis et al., (1989, Laboratory 
Manual, Cold Spring Harbor, Laboratory Press, Cold Spring Harbor, N.Y.). 

5 The collective steps of cloning employing bacterial plasmids is carried out by passage 

in Escherichia coli (E. coli) strain 5K or BJ, whereas those employing vectors derived from 
phage M13 are carried out by passage in E. coli NM522. As regards the steps of PCR 
amplification, the protocol as described in PCR Protocols— A guide to methods and 
applications (1990, edited by Innis, Gelfand, Sninsky and White, Academic Press Inc.) is 

10 applied. 

The fragments inserted into the different constructions described below are indicated 
precisely according to their position in the nucleotide sequence of: 

the Ad5 genome, as disclosed in the GenBank data bank under the reference M73260 , 

the adenovirus type-2 (Ad2) genome, as disclosed in the GenBank data bank under the 
15 reference JO 1949, the SV40 virus genome, as disclosed in the GenBank data bank under the 
reference J02400. 



The experiments described in the Examples can be summarized as follows: The 
product of adenovirus (Ad) type 5 gene IX (pIX) is known to actively participate in the 

20 stability of the viral icosahedron, acting as a capsidic cement. It has previously been 
demonstrated that pIX is also a transcriptional activator of several viral and cellular TATA- 
containing promoters, likely contributing to the transactivation of the Ad expression program. 
As it is described in the following ,by extensive mutagenesis, the functional domains could be 
delinated that are involved in each of the pIX properties: residues 22 to 26 of the highly 

25 conserved N-terminal domain are crucial for capsidic incorporation of the protein; the C- 
terminal leucine-repeat is responsible for pIX interactions with itself and possibly other 
proteins, a property that is critical for pIX transcriptional activity. It could also be shown that 
pIX takes part in the viral-induced nuclear reorganization of late infected cells: through self- 
assembly, the protein induces the formation of specific nuclear structures which appear as 

30 dispersed nuclear globules by immunofluorescence staining and as clear amorphous spherical 
inclusions by electron microscopy. The integrity of the leucine-repeat is also essential for the 
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formation of these inclusions. Together, the results demonstrate the multifunctional nature of 
pIX and provide new insights into Ad biology. 

INTRODUCTION 

Replication-deficient adenoviruses (Ad) efficiently transfer and express candidate 
5 therapeutic genes into a variety of dividing and post-mitotic cell types. For these reasons such 
viruses constitute effective vectors for direct in vivo gene therapy. However, several 
drawbacks such as toxicity, host inflammatory responses or transient in vivo transgene 
expression impair the full success of Ad vectors in human gene therapy protocols. Multiple 
factors are involved, among which some viral proteins whose functions are often not fully 
10 understood. 

In the present invention, the attention focused on the the product of gene IX (pIX) 
from Ad serotypes 2 and 5 (Ad2 and Ad5). pIX is a small polypeptide of 140 residues (14.3 
kDa), that is incorporated into the mature viral capsid. It is associated with hexon proteins to 
form group-of-nine hexons (GON) that make up the central region of each facet of the 

15 icosahedron. Precise detemiination of the stoichiometry of this assembly has revealed that 
there are 12 molecules of pIX, organized as four trimers per GON, and therefore 240 
molecules per virion. The protein acts as a capsidic cement and thereby enhances the thermal 
stability of virions. It is essential for packaging 100 % and more of the full length Ad DNA. 
By themselves, these properties of pIX appear important enough to be taken into 

20 consideration during the design of Ad vectors. 

Additional observations strongly suggest that pIX is more than a capsidic protein and 
may serve additional functions during the infectious cycle: (i) gene IX is the only structural 
protein coding gene which is uncoupled from the Ad major late promoter (MLP); (ii) its 
expression pattern follows a different time course and begins at an intermediate time post- 
25 infection (pi), much earlier than that of all the other structural proteins; (iii) finally, pIX 
accumulates in the infected cell nuclei with a speckled distribution. In agreement with this 
nuclear localisation, it has previously been shown that pIX is a transcriptional activator of 
several viral and cellular TATA-containing promoters, among which the Ad El a, E4 and 
MLP promoters. This finding led to the hypothesis that pIX could be involved in the 
30 trans activation of Ad genome expression. 
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To precisely delineate the functional domains of pIX responsible for the structural and 
transcriptional properties, an extensive mutational analysis of the pIX coding sequence has 
been performed. The highly conserved N-terminal part of the protein could be shown to be 
essential for the capsidic structural properties, whereas the C-terminal leucine-repeat (putative 
5 coiled-coil domain) is critical for the transactivating function. Accumulation of pIX results in 
the formation of specific nuclear structures (the clear amorphous inclusions), the function of 
which is presently unknown. The present results suggest that formation of these structures 
involves self-assembly of pIX through its coiled-coil domain. 

MATERIALS AND METHODS 

10 Cells and viruses. 

Monolayer human A549 cells were grown in Dulbecco's medium supplemented with 
10% fetal calf serum (FCS). 293 cells were grown in Dulbecco's modified Eagle medium with 
2% FCS. A549 cells (at 80% confluence) were infected with wild type (wt) Ad2 or Ad5 at a 
multiplicity of infection (MOI) of 50 plaque forming units (PFU) per cell. Mutant viral 

15 genomes were constructed as infectious plasmids by homologous recombination in 
Esclierichia coli, as described. [ Chattier et al., 1996, J. Virol. 70, 4805-4810] All vectors 
v.:: contain, in addition to alterations of gene IX (see below), a deletion in El (between 
nucleotides 459 and 3331) and in E3 (between nucleotides 28592 and 30470) (Ad El° E3°). 
Nucleotide numbering throughout this paper conforms to that of Chroboczek et al. Mutant 

20 viruses were amplified in 293 cells. Viral growth, titration, and storage were previously 
described. [ Lusky et al., J. Virol. 72, 2022-2032] 

Recombinant eukaryotic expression vectors 

The sequence encoding wild type (wt) pIX was derived from the Ad5 genome by PCR 
amplification as previously described [ Lutz et al., 1997, J. Virol. 71, 5102-5109], and 

25 inserted into 3 types of expression vectors: (i) the pAT4 vector (gift from M. Vigneron), in a 
site located 3 f to the sequences encoding the F domain of the human estrogen receptor (hER), 
generating a wt pIX fusion protein tagged at its N-terminus (F/IX); (ii) the pXJ41 vector, 
which yields an untagged protein; (iii) the pXX vector, a pSG5-derived vector, where the 
sequence encoding wt pIX was inserted in a site located 5' to the sequences encoding the F 

30 domain of the human estrogen receptor, generating a pIX fusion protein tagged at its C- 
terminus (IX/F). In these vectors, the expression of the wt or mutated pIX sequences was 
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directed by the cytomegalovirus (CMV) enhancer and herpes simplex virus type-1 (HSV-1) 
thymidine kinase gene promoter (pAT4 and pXJ41), or the SV40 promoter (pSG5-derivative 
pXX). 

The Ad El a promoter sequence (positions +100 to +560, numbering to Chroboczek et 
5 al.) was subcloned in front of the chloramphenicol acetyl transferase (CAT) reporter gene of 
the promoterless pBLCAT6 vector, as previously described. [Lutz et al., 1997, J. Virol. 71, 
5102-5109 

Point mutations and deletions in the pIX coding sequence (as indicated in Fig. 2A) were 
generated by following the protocol of the "QuickChange site-directed mutagenesis" system 
10 (Stratagene catalog#2005 1 8). All plasmids were verified by sequencing. 

Oligonucleotides sequences : 

Deletion WA: 5' GACAACGCGCATGCCCCCA*GGGGTGCGTCAGAATGTG 3' (SEQ 
ID NO:2) 

and 3' CTGTTGCGCGTACGGGGGT*CCCCACGCAGTCTTACAC 5' (SEQ ID NO:3) 
15 * is deletion of 6 bases 5' TGGGCC 3* (3672-3677, with reference to Ad5) and 3' ACCCGG 
5* 

Deletion SYL: 5' 

GATGGAAGCATTGTGAGC*ACAACGCGCATGCCCCCATGGGCCGGGG 3' (SEQ ID 
NO:4) and 3' 

20 CTACCTTCGTAACACTCG*TGTTGCGCGTACGGGGGTACCCGGCCCC 5' (SEQ ID 
NO:5) 

* is deletion of 9 bases 5' TCATATTTG 3' (3645-3653) and 3' AGTATAAAC 5' 

mutation RQN: 5' CCCCCATGGGCCGGGGTGCTCGACGTGATGGGCTCCAGC 3' (SEQ 

ID NO:6) 

25 and 3' GGGGGTACCCGGCCCCACGAGCTGGTGATGGGCTCCAGC 5' (SEQ ID NO:7) 
mutationGSSIDGR: 

5' GGGCCGGGGTGCGTCAGAATGTGATGCATATGCCCGTCCTGCCCGC 3' (SEQ ID 
NO: 8) 

and 3' CCCGGCCCCACGCAGTCTTACACTACGTATACGGGCAGGACGGGCG 5' (SEQ 
30 ID NO:9) 

mutation L114P: 5' GGATTCTTTGACCCGGGAGCCCAATGTCGTTTCTCAGC 3' (SEQ 
ID NO: 10) 
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and 3' CCTAAGAAACTGGGCCTCGGGTTACAGCAAAGAGTCG 5' (SEQ ID NO: 11) 
mutation V117D: 5' GGGAACTTAATGTCGACTCTCAGCAGCTGTTGG 3' (SEQ ID 
NO: 12) 

and 3' CCCTTGAATTACAGCTGAGAGTCGTCGACAACC 5' (SEQ ID NO: 13) 
5 mutationL114P/V117D:5' 

GGATTCTTTGACCCGGGAGCCCAATGTCGACTCTCAGCAGCTGTTGG (SEQ ID 

NO: 14) 

3'and 3' CCTAAGAAACTGGGCCTCGGGTTACAGCTGAGAGTCGTCGACAACC 5' 
(SEQ ED NO: 15) 

10 mutation poly ala: 5' CGCCGTTGGAGACTCCATGGACCGCCCGCGGG 3' (SEQ ID 
NO: 16) and 

3" GCGGCA ACCTCTGAGGTACCTGGCGGGCGCCC 5' (SEQ ID NO: 17) 

mutation E113K: 5' GGATTCTTTGACCCGGAAACTTAATGTCGTTTCtCAGC 3' (SEQ 

ID NO: IS) and 

15 3' CCTAAGAAACTGGGCCTTTGAATTACAGCAAAGAGTCG 5' (SEQ ID NO: 19) 

mutation Q1016K: 5' GACGGCTCTTTTGGCAAAGCTTGATTCTTTGACCCGGG 3' (SEQ 
ED NO:20) and 

3* CTGCCGAGAAAACCGTTTCGTTCTAAGAAACTGGGCCC 3' (SEQ ID NO:21) 
Transfections, cell extracts and Western-blotting 

20 A549 cells were transfected by calcium phosphate coprecipitation. For CAT assays, 

the cells were harvested 36 h after transfection, extracts were prepared and aliquots, 
normalised by protein concentration, were assayed for CAT activity as described. [Bocco et 
al., 1993, Oncogene 8, 2977-2986] CAT activities were determined from at least 3 
independent experiments and quantitated with a Bioimaging analyzer (Fuji Photo F'lm Co.). 

25 For immunoprecipitations, the cells were harvested 36 h after transfection, by 3 cycles 

of freeze-thaw in buffer A (50 mM Tris-HCl, pH 7.9; 20% glycerol; 1 mM DTT; and 0.1% 
NP40) containing 0.4 M KC1. The expression of recombinant proteins was verified by 
Western blotting. After an additional clearing step on protein G-sepharose to adsorb non- 
specific binding proteins, cell extracts were incubated for 2 h with 1 jag of the anti-F antibody, 

30 after which 30 ul of protein G-sepharose beads were added and incubation was continued for 
an additional 2 h. The beads were then washed 3 times at room temperature with buffer A 
containing 200 mM KC1 and 0.5 % NP40 (mild-salt conditions). The resin was then 
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dissociated by boiling for 5-10 min in SDS sample buffer. The bound proteins were detected 
on Western blots with specific antibodies using the ECL system (Amersham), as previously 
described. [ Bocco et al., 1993, Oncogene 8, 2977-2986] Anti-pIX rabbit polyclonal 
antibodies were raised against purified recombinant GST-IX fusion protein (anti-pIX). 
5 Monoclonal antibodies against the F domain of the hER (Mab3A6) have been described. 
[Lute et al., 1997, J. Virol. 71, 5102-5109] 

Electron microscopy 

A549 cells near confluence were infected at a MOI of 5-10 PFUs of Ad5 per cell for 
30 min. Then, monolayers were rinsed with PBS, fresh medium was added and the cells were 

10 reincubated for 16 and 30 h before fixation. A549 cells were transfected with the vector . 
generating the untagged wt pIX and cultured for 36 h. The cells were fixed with 4% 
formaldehyde (Merck) in 0.1 M S6rensen f s phosphate buffer, pH 7.2, at 4-8°C, for 1 h. 
During the fixation step, the cells were scraped from their plastic substrate and centrifuged. 
The resulting pellets were dehydrated in increasing concentrations of methanol and embedded 

15 in Lowicryl K4M (Polysciences Europe GmbH, Germany). Polymerisation was performed at - 
30°C, for 5 days, under long wavelength UV light (Philips fluorescence tubes TL 6W) and 
subsequently at room temperature for 1 day. Ultrathin sections were collected on Foixnvar- 
carbon coated copper grids, mesh 200. 

For identifying structures containing the pIX viral protein, grids bearing Lowicryl 
20 sections were floated for 2 min over drops of Aurion BSA-C (purchased from Biovalley, 
France) (0.01 % in PBS) in order to prevent background, prior incubation (30 min at room 
temperature) in the presence of anti-pIX polyclonal antibody diluted 1/50 in PBS. After rapid 
washing in PBS, grids were incubated at room temperature, for 30 min in the presence of goat 
anti-rabbit IgG conjugated to gold particles, 10 nm in diameter (British Biocell International 
25 LTD, Cardiff, UK) and stained with uranyl acetate prior to observation with a Philips 400 
transmission electron microscope, at 80 kV and 6 000 to 22 000-fold magnification. To make 
sure that secondary antibody did not bind non-specifically to biological material, it was 
verified that no labelling occurred when primary antibody was omitted. 

Immunofluorescence. 

30 Immunofluorescence staining experiments were carried out as previously described. 

[Lutz et al., 1997, J. Virol. 71, 5102-5109] A549 cells were fixed with formaldehyde (2% 
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vol/vol in PBS) and permeabilised with 0.1 % Triton X-100 in PBS. The primary antibodies 
were diluted in PBS containing 0.1 % Triton X-100. The anti-pIX rabbit polyclonal antibody 
was used as described [Lutz et aL, 1997, J. Virol. 71, 5102-5109] and the Mab3A6 anti-F 
antibody was used at 1/5000 dilution in PBS containing 0.1% Triton X-100. After incubation 
5 for 1 h, the coverslips were washed several times in PBS-0.1% Triton X-100 and then 
incubated with goat Texas-Red-conjugated anti-rabbit IgG and/or donkey FITC-labelled anti- 
mouse IgG (Sigma), at concentrations recommended by the suppliers. Nuclei were counter- 
stained with Hoechst 33258. After staining, the coverslips were mounted and analysed using a 
confocal laser scanning microscope (Leica). Image enhancement software was used to 
10 balance signal strength and 8-fold scanning was used to separate signal from noise. 

Example 1: Peptide sequence and functional domains of pIX 

Multiple sequence alignments between pIX proteins from human and animal Ad 
serotypes (Fig. 1) revealed a high degree of identity (95%) over the entire length of pIX from 

15 serotypes belonging to the same subgenus. Although the extent of homology between 
serotypes from different species was lower, two conserved domains could be identified when 
comparing human and animal serotypes: referring to the coordinates of pIX residues (Aa) 
from the human Ad2 serotype, these domains locate at the N-terminal (Aa8-39) and C- 
terminal (AalOO-121) ends of pDC, respectively. An additional, alanine-rich domain, specific 

20 to the human serotypes, could be delineated (Aa60-69). 

No particular structural motif could be identified within the N-terminal domain. The 
alanine-rich stretch which is unlikely to adopt any particular structure and may serve as a 
flexible link between the two halves of the pDC molecule. By contrast, the C-terminal domain 
clearly revealed features of a leucine-repeat (or coiled-coil domain), as suggested by the 

25 helical wheel representation (Fig. 2B): ten non polar amino acid residues (leucine and valine), 
spaced every 3 and 4 residues, align at positions (a) and (d) on one side of the wheel; these 
residues presumably provide a hydrophobic interface to interact with similar residues, 
symmetrically positioned (a') and (d 1 ) on a second monomer (Fig. 2B), thereby potentially 
adopting a coiled-coil conformation (34, 35, 39, 40). Moreover, in the case of Ad2 and Ad5, 

30 ionised residues of opposite charge, located on either side of the helical wheel at positions (e) 
and (g), may further stabilise protein assembly by symmetrically interacting with 
corresponding residues at positions (e ! ) and (g ? ). 
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To determine the functional significance of these domains, the effects of a series of 
deletions or point mutations altering the conserved sequence elements were examined (Fig, 
2A). Within the N-terminal half of pIX, amino acid stretches that are most highly conserved 
between all (Aal3-15, Aa22-23, Aa26-28, Aa31-39) or only human Ad serotypes (Aa63-70) 
were deleted (delta). The C-terminal leucine-repeat was disrupted by changing, either 
separately or simultaneously, leucine 114 and valine 117 to proline and aspartate, 
respectively, to generate mutants L114P, V117D or L114P-V117D (L-V). Interruption of the 
apolar series at positions (a) and (d) by these residues were indeed expected to disturb the 
correct alignment (proline) or hydrophobic bonding (aspartate). Two mutated forms of pIX 
were also constructed in which the net charge at position (e) was inverted by exchanging 
Aal06 or Aall3 with a lysine residue (mutants Q106K and E113K, respectively), thereby 
triggering electrostatic repulsion between protein monomers. 

Example 2: The N-terminal part of pIX critically contributes to its capsidic incorporation 

Earlier immunoelectron microscopy studies with purified Ad virions and pIX-specific 
antisera revealed that only the C-terminal portion of pIX was accessible to the antibodies. The 
authors concluded that the C-terminal part of the protein was exposed at the surface of the 
virions, while its N-terminal domain was hidden inside the viral capsid. To further define the 
pIX elements involved in the capsidic assembly, the mutations described above were 
introduced into the El -deleted viral genome by homologous recombination to generate 
viruses expressing the pEX variants during viral production. Because all viruses are El- 
defective, they were produced on the 293 cells which constituvely express the El a and Elb 
Ad5 genes. Each pIX mutant virus was grown on 293 cells and viral particles were easily 
purified by density gradient centrifugation. After titration, 2.10 10 particules virions (v) were 
submitted to SDS-PAGE, in parallel with aliquots of the corresponding infected crude extracts 
(e) (Fig. 3). The presence of pIX was then examined by Westem-blot analysis, using anti-pIX 
polyclonal antibodies. As a positive control, the presence of the wt pIX protein was verified 
both in infected-cell extracts and purified Ad El° virions (Fig. 3, lanes 3 and 4), whereas it 
was absent in both fractions from cells infected with a Ad El ° lacking gene DC (Fig. 3, lanes 1 
and 2). 

Mutations altering pIX within its C-terminal part (L114P, V117D and L-V) did not 
prevent incorporation of the mutant protein into the capsid (Fig, 3, lanes 12, 14 and 16), 
indicating that the integrity of the leucine-repeat is not required for this function. Moreover, 
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capsid stability appeared not to be altered (data not shown). By contrast, deletions within the 
conserved N-terminal domain of pEX, like in mutants delta22-23 and delta 26-28, completely 
abolished incorporation of pIX into the capsid (Fig. 3, lanes 6 and 8), despite nearly normal 
levels of pIX synthesis (Fig. 3 lanes 5 and 7). Mutation (delta 13-15) did not impair virion 
5 insertion of pIX but the resulting capsid was less stable, as indicated by thermolability 
measurements (data not shown). 

Together, these results define the N-terminal region spanning Aa22-28 as crucial for 
the correct and stable recruitment of pEX into the viral capsid, whereas the central (data not 
shown) and C-terminal regions are not involved at all. 

10 Example 3: The integrity of the C-terminal leucine-repeat and central alanine- 

stretch of pIX are essential for its transcriptional activity 

It has previously been shown that pIX exhibits transcriptional properties. [Lutz et al., 
1997, J. Virol. 71, 5102-5109] Recombinant pIX efficiently stimulated, in a dose-dependent 
manner, the activity of several viral and cellular TATA-containing promoters. To precisely 

15 delineate the transactivating domain of pIX, the effect of the complete set of pIX mutations 
on Ela promoter activation was examined (Fig. 4). To this end, vectors expressing wild type 
or mutated pIX sequences as proteins fused to the F epitope tag (F/IX) were transfected 
together with a CAT reporter gene driven by the Ela promoter. After having verified that 
equal levels of pIX were expressed, as revealed by immunoblotting with antibodies against 

20 the F epitope (data not shown), CAT activities were measured. Under these conditions, 
relative CAT activities will reflect the intrinsic transcriptional activating capacity of each 
recombinant protein. 

In agreement with the earlier structural analysis, truncated versions of pIX lacking half 
of the leucine-repeat (F/Ixdelta 111-140) or most of the N-terminal half (F/IXdeltall-74) of 
25 the protein, lost their transactivating property (compare in Fig. 4B, columns 2 and 14 or 7 
respectively) 

Point mutations within the sequence encoding the leucine-repeat in the C-terminal end 
of pDC severely reduced reporter stimulation: single (L114P, V117D) or double alterations (L- 
V) had effects similar to complete deletion (deltal 11-140) of the C-terminal part of the 
30 protein (Fig. 4B, compare column 2 with 11-14). Point mutations Q106K and E113K also 
reduced transactivation (Fig. 4B, compare columns 2 and 9-10), stressing the contribution of 
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electrostatic interactions in functional assembly of pIX monomers. The transactivation 
function of pIX also depends on the integrity of the central domain, since deletion of the 
corresponding poly-alanine stretch (delta63-70) led to a strong reduction of reporter 
stimulation (Fig. 4B, compare columns 2 and 8). By contrast, deletions within the N-terminal 
5 part of pIX had no detectable effect on its intrinsic stimulatory activity, since very similar 
levels of transactivation were obtained with F/IX: delta 13-15, delta 22-23, delta 26-28, delta 
3 1-39 pIX variants and the wild type protein (Fig. 4B, compare columns 2 and 3-6). 

Together, the results described herein suggest that the pIX transactivating function is 
dependent on the integrity of the C-terminal leucine-repeat, as well as on the central alanine- 
10 rich element. Interestingly, the N-terminal region, critically involved in capsidic integration of 
pIX, is not involved in this function. 

Example 4: The integrity of the C-terminal leucine-repeat and central alanine- 
stretch of pIX are essential for its self-interaction 

The presence of a leucine-repeat type of structure at the C-terminal end of pIX 
15 suggests that the protein may dimerise (or multimerise) by interacting through this element. 
To test this possibility, a vector expressing the non-tagged wt pIX was co-transfected into 
A549 cells with vectors expressing F epitope-tagged wild type or mutant pIX proteins (F/EX). 
As revealed by Western-blot analysis of cell extracts with monoclonal anti-F or polyclonal 
anti-pIX antibodies, the transfected vectors were expressed to very similar levels (data not 
20 shown). 

When these extracts were immunoprecipitated with the monoclonal anti-F antibody 
under mild-salt conditions and submitted to SDS-PAGE, a clear band of non-tagged wt pIX 
protein was revealed with the anti-pIX antibody as indicated in Fig. 5, in addition to the F- 
tagged protein variants. As a control, the non-tagged pIX was expressed in the absence of F- 

25 tagged pIX and could not be detected in the anti-F immunoprecipitate (not shown). The 
results strongly suggest therefore that the non-tagged wt pIX was co-precipitated together 
with the F epitope-tagged wt pIX (F/IXwt, Fig. 5, lane 1). That this co-precipitation involved 
the leucine-repeat domain, was demonstrated by the observation that single point-mutations 
(E113K, Q106K, V117D) or a double point mutation (L-V) disrupting this structure reduced 

30 or abolished the carrying effect (Fig. 5, compare lane 1 with lanes 5-8). Similarly, a mutant 
lacking the C-terminal mid-part of the leucine-repeat (delta 111-140) did not co-precipitate wt 
pIX (data not shown). As expected, mutations within either the N-terminal domain (delta 22- 



WO 02/096939 PCT/EP02/05942 

62 

23 and delta 26-28, Fig. 5; or delta 13-15 and delta 31-39, not shown) had no effect on the 
interaction. By contrast, the central alanine-rich element (delta 63-70) of pIX was essential for 
co-precipitation (Fig. 5, compare lanes 1 and 4) and is therefore also involved in the 
oligomerisation process. Furthermore, the observation that the same mutations affect both 
5 transactivation and interaction properties of pIX suggests that the two activities may be 
linked. 

Example 5: pIX accumulates in virus-induced clear amorphous inclusions and induces 
their formation by itself. 

Previously, it has been shown by immunofluorescence staining with pIX-specific 

10 antibodies, that pIX was predominantly associated with infected cell nuclei, in accordance 
with the transcriptional properties of the protein. [Lutz et al., 1997, J. Virol. 71, 5102-5109] In 
addition, it could be observed that the nuclear staining of pEX was evolutive, showing a 
speckled distribution at later times of infection. To study the intranuclear distribution of pIX 
protein more precisely, Ad5-infected A549 cells were examined by immunoelectron 

15 microscopy at different times post-infection (pi), from 16 to 30 h pi, in order to observe the 
accumulation of pIX as a function of the successive steps of nuclear alteration during 
infection. These experiments demonstrated that protein pIX actively induces specific nuclear 
clear amorphous (c.a.) inclusions : Figure 7A represents Ad2-infected A549 cells at 
intermediate stage of nuclear transformation (14-16 h pi) processed for immunogold labelling 

20 with anti-pIX polyclonal antibody on lowicryl sections of formaldehyde-fixed cells. Gold 
particles are scattered over the fibrillo-granular network (fg), one component of the viral 
region, and accumulate over an enclosed small irregularly-shaped clear amorphous inclusion 
(c.a. inclusion; star). The accumulation site of viral single-stranded DNA (a), the other 
compartment of the viral region, is entirely devoid of pIX protein, c: cytoplasm. Bar, 0.5 jam. 

25 As shown in Figure 7B, later stages of Ad-mediated nuclear transformation (24-30 h pi) are 
characterised by the presence of progeny viruses. The roughly spherical c.a. inclusion (star) is 
intensely and homogeneously labelled It is located in the electron-translucent region (e) which 
separates the perinuclear layer of host chromatin (ch) from the large, centrally-located viral 
region (vr). Some viruses (v), both scattered in the electron-translucent region and clustered 

30 within the viral region are labelled. C : cytoplasm, Bar 0.5 |im. As shown in Figure 7C, 
overexpression of recombinant pIX protein induces the accumulation of the protein within 
newly-formed c.a. inclusions: A549 cells were transfected with the vector expressing the 
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untagged wild type pIX. Gold particles accumulate over the entire surface of an ovoid c.a. 
inclusion present in the nucleoplasm. Ti clearly appears that the labelled inclusion (star) is 
similar to those observed in (B) following adenovirus infection. 

No significant labelling was observed with the anti-pIX antibody before 16 h pi (not 
shown). At this time, the infected cells were mainly at the intermediate stage of nuclear 
alteration. Among several virus-induced structures (including sites of viral DNA replication, 
transcription and sites of viral genome or single-stranded viral DNA accumulation), pIX was 
detected in small, irregularly shaped or spherical, clear amorphous inclusions. At a later 
stage (24-30 h pi), when a central viral compartment and a perinuclear electron-translucent 
area with protein crystals and isolated viruses were apparent, pIX remained concentrated 
within the c.a. inclusions which became more frequent and spherical. Sometimes two or 
three of these inclusions were juxtaposed . In addition, pIX was observed over the 
crystalline arrays of viruses and the isolated viruses particles . 

Hence, since pIX protein is efficiently neosynthesized and belongs to the late phase 
of infection, the protein is predominantly associated to c.a. inclusions which are dynamic in 
their shape and location in the nucleus. The amount of pIX in the cell increases as the 
infection progresses, and the pIX protein accumulates in the nucleus. Irrespective of their 
shape, size and location within the nucleus, they were always intensely and homogeneously 
labelled with the anti-pIX antibody. 

The ultrastructural data suggest that pEX is the main component of the virus-induced 
c.a. inclusions. To determine whether pIX was directly responsible for their occurrence, it was 
attempted to express the protein in cells by transfecting with a wild type pIX expression 
vector, i.e. in the absence of any other viral protein. The morphology of pIX-expressing cells 
was similar to that of non-transfected cells except for the additional presence of c.a, 
inclusions in the nucleoplasm, identical to those observed in lytically infected cells . 
Depending on the amount of expressed pIX (i.e. as a function of time post-transfection), c.a. 
inclusions were variable in size and frequency, but always showed the same amorphous 
aspect (data not shown). Immunogold detection of pIX protein resulted in an intense labelling 
of each c.a. inclusion and in a slight labelling of the surrounding nucleoplasm and cytoplasm. 
Therefore, in the absence of other viral proteins, pEX is able to induce the formation of c.a. 
inclusions similar to those induced by Ad infection. 
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Specific immunofluorescence staining of cells bearing wt Ad5 or transfected with the 
wt pIX-expressing vector revealed a speckled distribution of the protein in the nucleus , most 
likely corresponding to the accumulation of pIX within the c.a. inclusions observed by 
5 electron microscopy. To identify peptidic domains of pIX which may be responsible of the 
formation of c.a. inclusions, the effect of the above-mentioned set of mutations (see Figure 
2A) on the nuclear distribution of pIX in transfected cells was examined. Mutations affecting 
either the N-terminal (delta 13-15, delta 22-23, delta 26-28, delta 31-39) or central (delta 63- 
70) domains of pIX did not affect the formation and nuclear location of the c.a. inclusions, 

10 since they all yielded the same speckled distribution (data not shown), as the wild type protein 
upon transfection. By contrast, alteration of the leucine-repeat of pIX by modification of the 
net charge of specific residues (mutations E113K or Q106K) drastically changed the 
intracellular distribution of the corresponding pIX variants which were then confined to the 
cytoplasm, as revealed by immunofluorescence staining . Furthermore, although the overall 

15 level of mutant expression was similar to that of the wild type protein (as revealed by Western 
blotting, data not shown), this variant accumulated with a micro-speckled pattern. 

Similarly, both point mutations (L114P, V117D, L-V) and deletions (delta 111-140), 
affecting the integrity of the leucine-repeat, abolished formation of the c.a. inclusions and 
resulted in a diffuse distribution of the altered pIX throughout the nucleus and cytoplasm . 

20 Fusion of the F epitope-tag, at the C-terminal end of wild type pIX (IX/F), also prevented the 
nuclear accumulation of the protein and induced the same diffuse pattern, despite the integrity 
of the leucine-repeat. This effect, most likely related to the steric hindrance imposed by the 
tag, suggests that free access to the leucine-repeat was essential for nuclear retention of pIX. 
Interestingly, when cells were cotransfected with vectors expressing the untagged (wt pIX) 

25 and C-terminally F-tagged (IX/F) proteins, both types of proteins accumulated within the 
same inclusion bodies, as revealed by the merged immunofluorescence staining . It appears 
therefore that the unfocused IX/F protein distribution was redirected by the coexpressed wt 
pIX into its corresponding nuclear accumulation sites. By contrast, when the F-tagged partner 
of the cotransfection carried an alteration within the leucine-repeat (IX/F:L-V), this 

30 recruitment was nearly abolished, as shown by the persistent diffuse distribution pattern of the 
F-tagged mutant . 
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Together, the results presented herein clearly indicate that the leucine-repeat of pIX is 
chiefly involved in the formation and localisation of the c.a. inclusions. Furthermore, the data 
strongly suggest that the assembly of the pIX-specific inclusion bodies is an active process 
driven by pIX itself, 

5 DISCUSSION 

Viruses, as obligatory cell parasites, usually evolved towards the highest possible 
degree of simplification of their structure and components to reach at minimal expense the 
most efficient rates of proliferation. Adenoviruses comply with this rule, not only in 
exploiting the coding capacity of their genome by using alternative reading frames, but also in 

10 producing proteins with multiple biological activities. The product of the Ad gene IX is an 
example of such multifunctional proteins: pIX (140 residues) is a structural component of the 
viral capsid, acts as a transcriptional activator and accumulates in infected cell nuclei as 
specific structures (c.a. inclusions), the function of which remains to be established. In the 
present study, an extensive series of site-directed mutagenesis of pIX was performed in order 

15 to define the corresponding functional domains of the protein. 

Structural involvement of pIX in viral capsid assembly 

The pIX protein has previously been described as a capsidic cement between the viral 
hexons, thereby optimising DNA packaging capacity and thermal stability of Ad virions. 
Based on immunochemical approaches, it had been suggested that the N-terminal portion of 

20 pIX sticks inside the capsid while its C-terminal part points outwards. The results shown 
herein indicate that residues 22-28 are essential for pIX incorporation into the capsid, virion 
thermostability and elevated viral production. The importance of these residues is further 
supported by their high degree of conservation among human and animal Ad serotypes. 
Additional residues within the N-terminal region of pIX likely contribute to this function, as 

25 suggested by the effect of the delta 13-15 deletion which did not impair pIX capsidic 
integration but affected virion stability. 

Interestingly, the present results rule out any contribution of the putative coiled-coil 
element of pIX to these capsid properties. This clearly indicates that the corresponding 
capsidic interactions occur through other elements than the coiled-coil domain and suggests 
30 that the N-terminal element defined above fulfils this function. Since coiled-coil elements 
have been shown to be responsible for the trimerisation or oligomerisation of other proteins, 
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like the well documented yeast GCN4 or HTV-1 gp41, it is likely that the coiled-coil element 
of pIX might also be involved in the trimerisation of this protein as it occurs in the GON 
complexes. This raises the question of whether the pIX molecules are still organised as 
trimers in the leucine-repeat mutants or whether the N-terminal domain of pIX, in close 
5 contact with the hexon molecules, is primarily responsible for the integration of pDC trimers 
into the virion capsid. Clearly, additional experiments, including co-immunoprecipitation 
assays, three-dimensional structure analyses will be required to solve these questions. 

Transcriptional activity of pIX 

It has previously been reported that recombinant pIX exhibits properties of a 
10 transcriptional activator when assayed in transfection experiments or in a reconstituted in 
vitro transcription system. By the present data, it was shown that the integrity of the leucine- 
repeat element of pIX is mandatory for this transactivatirig function, while the highly 
conserved residues within the N-terminal half of the molecule are not associated to this 
activity. Larger deletions in the N-terminal region (11-74) or just removing the alanine-rich 
15 element (63-70) also impaired pIX transcriptional activity, despite preserving the coiled-coil 
domain. As suggested by predictive structure analyses (data not shown), the poly-alanine 
stretch may serve a hinge function in the pIX molecule. It is therefore possible, that mutations 
to this element disrupt the global structure of pIX and thereby affect its essential functions. 

Additional evidence (M.R.-C. et al., in preparation) revealed that, soon after cell entry 
20 and virion decapsidation, the released capsidic pIX accumulates in the cell nucleus, as soon as 
45 min pi until 6 h pi. These pIX molecules may then activate the highly responsive El a 
promoter, thus behaving as a ready-to-use transactivator, much like the VP 16 factor in the 
case of HSV-1 infection. Furthermore, it is likely that the newly synthesised pIX, which starts 
accumulating at intermediate times pi, also contributes to the activation of the late phase of 
25 Ad infection by stimulating MLP activity. Thus, it will be of interest to decipher the 
molecular mechanism of pDC-mediated transactivation. 

As expected from the absence of basic residues flanking its leucine-repeat (to make up 
a bona fide basic-leucine-zipper) and from the lack of any other DNA recognising motif, pIX 
has no DNA-binding activity (data not shown). By contrast, as revealed by co- 
30 immunoprecipitation experiments, the leucine-repeat is involved in the interaction of pIX with 
itself, allowing its homo-dimerisation or -oligomerisation via a coiled-coil structure. The fact 
that the same mutations affect both pIX self-interaction and transactivation properties 
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suggests that the two activities are directly correlated. Alternatively, or in addition, pIX might 
interact through this leucine-repeat element with components of the transcription apparatus: 
preliminary results indeed suggest that pIX contacts specific RNA polymerase II subunits and 
general transcription factors, thus mimicking other viral transactivators like Ad El a, HSV-1 
pX, or VP16. 

Interestingly, one feature that is shared by all pIX-responsive promoters assayed so 
far, whether from viral or cellular origin, is the presence of a canonical TATA box. It will be 
of interest to identify the mechanism of this specific promoter targeting. It is tempting to 
speculate that such a promoter preference for pIX is of some advantage for Ad propagation 
since all viral promoters, with the exception of the E2 promoter, contain a TATA box. This 
hypothesis is further supported by the fact that only episomal genes (i.e. either in plasmidic or 
viral form), but not chromosome-integrated reporters, have been found to be responsive to 
pIX. 

Nuclear accumulation structures of pIX 

Despite the absence of any detectable homology with consensus nuclear localisation 
signals (NLS) in its peptide sequence, pIX concentrates within the cell nucleus. hHerein, it 
has been shown that the leucine-repeat of pIX is essential for this nuclear accumulation, as 
well as for the formation of the c.a. inclusions. Together, these observations indicate that pIX, 
by virtue of its low molecular weight (M.W.), freely diffuses from the cytoplasm to the 
nucleus where it is retained, most likely through interactions involving the leucine-repeat and 
specific nuclear components. Preliminary biochemical evidence suggest that pIX actually 
associates with fractions of the nuclear matrix . It is therefore likely that the targeting of the 
nuclear matrix by pEX constitutes the initial step in the assembly of the c.a. inclusions, 
providing a nucleation point for pEX oligomerisation. Since these nuclear structures are built 
up in the absence of any other viral protein except pIX, it is to be concluded that they reflect 
an intrinsic property of pIX. 

The leucine-repeat plays a central role in both pIX transcriptional activity and ability 
to form c.a. inclusions. However several lines of evidence clearly indicate that these functions 
are two independent properties of pIX: (i) during the late phase of infection, c.a. inclusions 
were always found excluded from the viral transcription sites; (ii) neither RNA polymerase II 
nor primary transcripts could be detected within the c.a. inclusions from Ad-infected cells or 
cells transfected with wild type pIX expressing vectors (data not shown); (iii) a mutant 
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lacking part of its alanine-rich element (delta 63-70) lost its capacity to transactivate the Ela 
promoter (Fig. 4) and to self-associate under immuno-precipitation conditions (Fig. 5), but 
retained its ability to accumulate into nuclear c.a. inclusions (data not shown); (iv) conversly, 
fusion of the F epitope to the C-terminal end of pIX (IX/F), completely abolished the ability 

5 of the protein to form c.a. inclusions in the nucleus > but did not affect its transcriptional 
properties (Fig. 4, lane 15), unless the structural integrity of the leucine-repeat was altered 
(Fig. 4, lane 16). The most simple interpretation to account for the apparent unrelatedness of 
these pIX functions, is that pIX might exert its transcriptional properties only at low 
concentrations (i.e. at initial times of infection and just after the onset of pIX synthesis) while 

10 it starts forming nuclear inclusions at higher concentrations by accumulating on nuclear 
matrix structures, as observed later in infection or after transfection. The finding that addition 
of the F-tag at the C-terminal end of pIX differentially affects pIX transcriptional activity and 
c.a. inclusion formation, may therefore merely reflect the preferential impairment of contacts 
implicated in the assembly of c.a. inclusions, compared to those required for promoter 

15 transactivation. 

The physiological role of these c.a. inclusions remains puzzling. The possibility that 
they represent viral protein storage sites, serving as a source of readily mobilisable proteins 
for subsequent virus assembly, seems unlikely for the following reasons: (i) their formation is 
induced exclusively by pIX, whose synthesis is disconnected from that of all the other virion 
20 proteins, in Ad-infected cells; (ii) although low amounts of hexon can be observed in these 
structures at late times pi, no penton and fiber proteins can be detected. 

Similar globular structures have been described in cells infected with HSV-1 where 
virus-induced "finely granular translucent patches" were observed late in infection. These 
structures have been shown to contain viral capsid proteins and, in addition, to concentrate 

25 specific cellular proteins like the promyelocytic leukaemia (PML) and SP100 proteins which 
are diverted from the so-called PML oncogenic domains (PODs) that are the normal PML and 
SP100 accumulation sites. The detection of PML and SP100 proteins in the c.a. inclusions of 
Ad-infected cells further stresses the similarity between the two virus-induced structures. This 
observation also raises the interesting possibility that pIX interferes with the function of PML 

30 and related proteins, thus relaying, during later times in infection, the effect of the Ad early 
E4orf3 product which has also been shown to delocalise PML from the PODs. We are 
currently exploring these hypotheses. 
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It appears therefore that pIX plays multiple functions during infection. Interestingly, it 
shares some of these functions with the product of the other Ad intermediate gene, pIVa2, 
both proteins are transcriptional activators, take part in the viral-induced alterations of the 
host cell by accumulating as specific nuclear inclusions and are present in the mature virion 
particles. No doubt that these multifaceted viral entities have yet additional secrets to reveal 
and thus clearly deserve attention when designing Ad-based vectors for gene therapy 
protocols. 

Example 6 : Insertion of a polylysine binding moiety in the pIX protein 

The human embryonic kidney 293 cell line (ATCC, Rockville, MD, USA) and the 
CHO cell line (ATCC ; CCL-61) were grown at 37°C in DMEM supplemented with 10 % 
Fetal Calf Serum. 

Construction of pLK-modified viral genomes 

pEX coding sequence was mutated as described above using the QuickChange site- 
directed mutagenesis system (Stratagene), to introduce mutations in pIX coding sequence: 
L114P, V117D and the double mutation L114P-V117D, respectively. 

Introduction of 7K and Gly-Ser-(Ser-Ala) 4 -Gly-Ser-{K) 7 peptides between Leul31 and 
Lysl32 residues within C terminal part ofpIX 

The method using the QuickChange site-directed mutagenesis system (Stratagene) 
allowed to introduce the restriction site BamHI between the base codons encoding Leul31 
and Lysl32 within the pIX coding sequence. In this case, the following sense and antisense 
oligonucleotides were used : 5'-cgc cag cag gtt tct gcc ctg gga tec- aag get tec tec cct ccc aat 
gcg g3' (SEQ ID NO : 22) and 5'-c cgc att ggg agg gga gga age ctt gga tec cag ggc aga aac 
ctg ctg gcg-3' (SEQ ID NO : 23). 

The oligonucleotides 5'- ga tec aag aag aag aag aag aag aag taa g-3' (SEQ ID NO : 
24) and 5'- ga tec tta ctt ctt ctt ctt ctt ctt ctt tga tec cgc cga cgc gga ggc gga gcc get g-3' (SEQ 
ID NO : 25) z were hybridized to each other and cloned into the BamHI site to directly 
introduce the 7K peptide within the C -terminal part of pIX. Alternatively, the 
oligonucleotides 5'- ga tec age gcc tec gcc tec gcg teg gcg gga tea aag aag aag aag aag aag 
aag taa g-3' (SEQ ID NO : 26) and 5'- ga tec tta ctt ctt ctt ctt ctt ctt ctt tga tec cgc cga cgc gga 
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ggc gga gcc get gga tec g-3' (SEQ ID NO : 27) were hybridized to each other and cloned into 
the BamHI site to directly introduce the 7K binding moiety connected to the pIX sequence via 
the spacer Gly-Ser-(Ser-Ala>4-Gly-Ser. 

5 Introduction of the 7K peptide between Q127 and Vall28 residues within the C- 

teiminal part ofpIX 

Alternatively, the QuickChange site-directed mutagenesis system (Stratagene) was 
also used to introduce in two steps the 7K peptide between the residues Q127 and V128 
within the C-terminal part of pIX. Following sense and antisense oligonucleotides were used 

10 in the first step : 5'-g ctg ttg gat ctg cgc cag cag aag aag aag aga tct gcc ctg aag get tec tec cct 
ccc aat gcg g-3' (SEQ ID NO : 28) and 5'-c cgc att ggg agg gga gga age ctt cag ggc aga tct ctt 
ctt ctt ctg ctg gcg cag ate caa cag c-3' (SEQ ID NO : 29), and then in the second step, the 
following oligonucleotides : 5'-g ctg ttg gat ctg cgc cag cag aag aag aag aag aaa aaa aaa taa tct 
gcc ctg aag get tec tec cct ccc aat gcg g-3' (SEQ ID NO : 30) and 5'- c cgc att ggg agg gga 

15 gga age ctt cag ggc aga tta ttt ttt ttt ctt ctt ctt ctt ctg ctg gcg cag ate caa cag c-3' (SEQ ID NO : 

? 31). 

- ^ The amino acid sequence of the modified pIX protein can be read as follows starting 

from L in position 100 (see SEQ ID NO:32 and 33, respectively): 

LTALLAQLDSLTREP(1 14)NVD(1 17)SQQLIJDLRQQKKKKKKK (7K binding moiety) 

20 or 

LTALLAQLDSLTREP(1 14)NVD(1 17)SOOLLDLROOVSAL KGS(SA) 4 S KKiagEgKK 
(spacer / 7K moiety). 

All vectors contain, in addition to alterations of gene pIX, a deletion in El and in E3 
and comprise in replacement of El the LacZ gene (encoding beta galactosidase) driven by the 
25 CMV promoter. 

The infection efficiency of the 7K-containing pIX constructs for a number of tumoral 
cells was evaluated in comparison to the conventional targeted adenovirus comprising the 7K 
binding moiety comprised in the capsid fiber (Ad Fb 7K described by Leissner et al (2001, 
Gene therapy 8, 49-57). 

30 

Cell lines infected by 7K modified Ad 



OCID: <WO_02096939A2_I_> 



\ 

WO 02/096939 PCT/EP02/05942 

71 

The human adenocarcinoma prostatic PC3 (ATCC CRL-1435) and LN-CAP (ATCC 
CRL-10955) cell lines, the human squamous carcinoa from head and neck SqCCYl (Dr. 
Reuben Lotan Univ Texas, Houston, USA) cell line, and the human breast cancer T47D 
(ATCC HTB-133), H3396 (BMS) and SKBR3 cell lines were grown at 37°C in DMEM or 
5 RPMI supplemented with 10 % Fetal Calf Serum and antibiotics. 

Conditions of infection 

Viruses in the range of concentrations 10, 100 and 1000 particles / cell were added to 
target cell monolayers for 1 hour at 4°C. The inoculum was removed and the cells were 

10 washed twice with cold medium before they were incubated for 48h at 37°C. Cells were then 
fixed and stained for determining (3-galactosidase activity, as previously described by P. 
Leissner et al. (2001, Gene Therapy 8, 49-57). Infected cells were then counted. Alternatively, 
the |3-galactosidase activity of whole cell lysate was monitored using a chemiluminescent 
substrate (luminescent fi-galactosidase detection kit, Clontech, Palo Alto), as also described 

15 by P. Leissner et al. (2001, Gene Therapy 8, 49-57). 

For all tumor cell lines tested including human SKBR3, T47D, H3326 , A549 cells 
and murine Renca, B16F0, the infectivity of Ad-pIX 7K was superior by 10 to 100 fold 
compared to the other vectors and especially the conventional targeting vector Ad-Fb-7K 
comprising the 7K binding moiety inserted at the C-terminus of the fiber. 

20 

In these studies, it was demonstrated that peptide-binding moieties can be inserted 
within the C-terminal part of the Ad pIX protein. The superior infection of Ad-pIX 7K into a 
variety of human and murine tumor cells is consistent with the fact that many tumor cells 
have an increased expression of heparan sulfate proteoglycans on their surface. Therefore, 
25 retargeting the vector with tumor-specific ligands using die pIX protein will expand the 
applicability of adenoviral vectors for cancer gene therapy. 
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Claims 

1. An adenovirus pIX protein modified by mutation of one or more amino acids of said 
pIX protein as compared to the corresponding wild-type pIX protein and/or so as to 
comprise a binding moiety, wherein the presence of said modified pIX protein in a 
virus or virus-like particle results in an improved gene delivery efficiency in a target 
cell of said virus or virus-like particle as compared to the gene delivery efficiency of a 
corresponding virus or virus-like particle containing said corresponding wild-type pIX 
protein. 

2. The adenovirus pIX protein of claim 1, wherein said mutation results in the presence 
of a binding moiety in the pIX protein. 

3. The adenovirus pIX protein of claim 1 or 2, wherein said binding moiety is capable to 
bind to a target cell. 

4. The adenovirus pIX protein of any one of claims 1 to 3, wherein said amino acids to 
be mutated are selected in the N-terminal part of the protein. 

5. The adenovirus pIX protein of any one of claims 1 to 3, wherein said amino acids to 
be mutated are selected in the C-terminal part of the protein. . 

6. The adenovirus pIX protein of claim 5, wherein said amino acids to be mutated are 
selected in the C-terminal leucine -repeat of the protein. 

7. The adenovirus pIX protein of claim 6, which shows a mutation selected from the 
group consisting of 

(a) a substitution of the leucine residue at a position corresponding to position 114 
of SEQ ID NO: 1 by a proline residue; 

(b) a substitutions of the valine residue at a position corresponding to position 117 
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(c) substitution of the leucine residue and the valine residue at positions 
corresponding to position 114 and 117, respectively, of SEQ ID NO:l by a 
proline and an aspartic acid residue, respectively. 

8. The adenovirus pIX protein of any one of claims 1 to 7 having a binding moiety which 
is polylysine. 

9. The adenovirus pEX protein of claim 8 comprising the amino acid sequence of SEQ ID 
NO:32 or 33. 

10. A nucleic acid molecule comprising a nucleotide sequence encoding the adenovirus 
pIX protein of any one of claims 1 to 9. 

11. An adenoviral vector comprising the nucleic acid molecule of claim 10. 

12. A method for producing a virus or virus-like particle comprising the steps of 

a. transforming a suitable host cell with an adenoviral vector according to claim 
11; 

b. culturing the transformed cell line under conditions suitable io allow formation 
of a virus or virus-like particle from said adenoviral vector; and 

c. recovering the virus or virus-like particle formed in step (b) from the culture. 

13. A method for producing a virus or vims-like particle comprising the steps of 

a. transforming a suitable host cell with an adenoviral vector which encodes a 
wild-type pIX protein; 

b. modifying the coding sequence for the pIX protein in the adenoviral vector so 
that the encoded pIX protein is one according to any one of claims 1 to 9; 

c. culturing the host cell under conditions suitable to allow fdrmatioii of a virus 
or virus-like particle from the adenoviral vector of step (b); and 

d. recovering the virus or virus-like particle formed in step (c) from the culture. 
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14. A virus or virus-like particle comprising the adenovirus pIX protein of any one of 
claims 1 to 9, the nucleic acid molecule of claim 10 or the adenoviral vector of claim 
11 or obtainable by the method of claim 12 or 13. 

5 15. The virus or virus-like particle of claim 14 which is substantially incapable of binding 
its host cell. 

16. A eukaryotic host cell comprising the adenovirus pIX protein of any one of claims 1 
to 9, the nucleic acid molecule of claim 10 or the adenoviral vector of claim 11 or 

10 being infected with or comprising the virus or virus-like particle of claim 14 or 15. 

17. A complementation cell line suitable for producing the virus or virus-like particle of 
claim 14 or 15 or for applying the method of claim 12 or 13. 

15 18. A pharmaceutical composition comprising the adenoviral vector of claim 11, the virus 
or virus-like particle of claim 14 or 15, the host cell of claim 16 or the 
complementation cell line of claim 17, wherein said adenoviral vector, virus or vims- 
like particle host cell or complementation cell line is capable of expressing a 
therapeutically useful gene, and optionally a pharmaceutical^ acceptable carrier. 

20 

19. Use of the adenoviral vector of claim 11, the virus or virus-like particle of claim 14 or 
15, the host cell of claim 16 or the complementation cell line of claim 17, wherein said 
adenoviral vector, virus or virus-like particle, host cell or complementation cell line is 
capable of expressing a therapeutically useful gene, for the preparation of a 
25 pharmaceutical composition for gene therapy in a patient in need for such therapy. 
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FIGURE 6(1/30) 

- - CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GG 

5 ' #GGTGGAGT 60 

- - TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG 

GC - 

#GGAAGTGT 120 

- - GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT 

10 GA - 

#CGTTTTTG 180 

- - GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG 

GA - 

#TGTTGTAG 240 " 

15 - - TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG 

AA - . . 

#TAAGAGGA 3 00 

- - AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 

GG - 

iO #GCCGCGGG ' 3 60 

- - G ACTTTG AC C . GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT 
TT - . 

#CCGCGTTC 420 

- - CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT 

25 TT - 

' 1 #ATACCCGG 48 0 * 

- - TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC 

TC - 

#CGAGCCGC 540 

30 - - TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA 

TT - 

#ACCGAAGA 600 

- - AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA 

AT - 

35 "#CTTCCACC 6 60 

- - TCCTAGCCAT TTTGAAC C AC CTACCCTTCA CGAACTGTAT GATTTAGACG 

TG - 

#ACGGCCCC 72 0 

- - CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA 

P TG - 

. #TTGGCGGT 780 

- - GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG 

AG - 

#CCGCCTCA 840 

45 - - CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT 

CT - 

#ATGCCAAA 900 

- - CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC 

CC - 

50 #AGTGACGA 9 60* 

- . - CGAGGATGAA* GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG 

GG - 

#CACGGTTG 102 0 

- - CAGGTCTTGT C ATTATC AC C GGAGGAATAC GGGGGACCCA GATATTATGT 

55 GT - 

#TCGCTTTG 108 0 

- - CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG 

CA - . 
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SGTGGGTGA 1140 

- . - TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT 

#GGTTTAAA 12 00 

- - GAATTTTGTA" TTGTGATTTT . TTTAAAAGGT CCTGTGTCTG AACCTGAGCC 

#AGCCCGAG' 1260 

- - CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC 

#TCCTGAGA 132 0 

- - CGCCCGACAT . CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG 

#ACTCCGGT 13 8 0 

15 - " CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT 

TA - 

#AACCAGTT 1440 

- - GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT 

GC - 

20 #TTAACGAG 1500 

- - CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT 

#ACCTGTGA 1560 

- TTGCGTGTGT. GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT 

#TAAAGGGT 162 0 

~ " GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG 

#ATATAATG 168 0 

- - CGCCGTGGGC TAATCTTGGT TACATC TG AC CTCATGGAGG CTTGGGAGTG 

#TGGAAGAT 1740 

- - TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG 

#TTTGGAGG 18 00 

- - TTTCTGTGGG GCTCATCCCA GGCAAAGTTA . GTCTGCAGAA TTAAGGAGGA 

#ACAAGTGG 18 60 

- - GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA 

#TGGGTCAC 192 0 

- - CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT- TTTCCACACC 



AA 
25 ' AA 
GT 

30 

TT 
GT 

35 

• TT 
40 TC 

y' 

GG 



#GGCGCGCT 198 0 

45 - - - GCGGCTGCTG TTGCTTTTTT GAGTTTTATA AAGGATAAAT GGAGCGAAGA 

AA - 

#CCCATCTG 2 04 0 

- - AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT 

TG - 

50 #TGAGACAC 2100 

- - AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC 

#AGGAGCAG 216 0 

- - CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG 

ttACCCGAGA 2 22 0 

- - GCCGGCCTGG ACCCTCGGGA atgaatgttg TACAGGTGGC TGAACTGTAT 

#AGAACTGA 22 8 0 

- - GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG 

#GGAGCGGG 23 4 0 



GG 

55 . GA 
CC 

60 

AG 
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. - - GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG AC - 
#CAGACACC 2400 

- - GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG . 
5 CT - 4 

' #TGATCTGC 2460 

- - TGGCGCAGAA GTATTC CAT A GAGCAGCTGA CCACTTACTG GCTGCAGCCA 
GG - • 

#GGATGATT 2 52 0 

10 - - TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCACTTAG GCCAGATTGC 

AA - 

#GTACAAGA 2580 

- - TCAGCAAACT TGTAAATATC AGGAATTGTT GCTACATTTC TGGGAACGGG 

GC - 

15 #CGAGGTGG 2640 

- - AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT GATAAATATG 

TG - 

#GCCGGGGG 27 00 

- - TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC 

20 AA - 

#TTTTAGCG - 2760 

• - - GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC 
TA - . 
#TGGGTTTA 2 82 0 

25 -* - ACAATACCTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC 

TT - 

#TTACTGCT 2880 

- - GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA . 

TG - 

-30 * ^CCTCTTTG 2940 

- - AAAGGTGTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGC C AC 

AA - 

#TGTGGCCT 3 000 

- - CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT 

35 AA - 

#CATGGTAT 3060 

- - GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC ■ 

AA - 

#CTGTCACC 312 0- 

40 - - " TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG 

TT - 

#TGAGCATA 318 0 

- - ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC 
CT - . 

45 #ACCTTACC 3240 

- - AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC 

AA - 

#GGTGAACC 3300 

- - TGAACGGGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC 

50 GA - 

#TGAGACCC 33 60 

- - GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG 

CC - 

#TGTGATGC 342 0 

55 - - TGGATGTGAC CGAGGAGCTG AGGCCC GATC ACTTGGTGCT GGCCTGCACC 

CG - 

#CGCTGAGT 3 48 0 

- - TTGGCTCTAG C G ATGAAG AT ACAGATTGAG GTACTGAAAT GTGTGGGCGT 

GG - 

60 #CTTAAGGG 3 540 

- - TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT 

GC - 
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#AGCAGCCG 3 600 

- - CCGCCGCCAT GAGCACCA&C TCGTTTGATG GAAGCATTGT GAGCTCATAT 

TT - 

5 #GACAACGC 3 660 

- - GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT 

GA - 

#TGGTCGCC 372 0 

- - CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA 

10 AC - 

#GCCGTTGG • *3780 

- AGACTGCAGC CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG 

AT - 

#TGTGACTG 3 840 

15 - - ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC 

GC - 

#CCGCGATG 3 9 00 

- - ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT 

AA - 

20 #TGTCGTTT 3 9 60 

- - CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC 

TC - 

#CCCTCCCA 4020 

- - ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC 

25 AA - 

#GCAAGTGT 4080 

- - CTTGCTGTCT .TTATTTAGGG GTTTTGCGCG CGCGGTAGGC CCGGGACCAG 

. CG - 

#GTCTCGGT 4140 

30 - - CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG 

AT - 

#GTTCAGAT 42 00 

- - ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT 

TC - 

35 #ATGCTGCG 42 60 

- - GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC 

CT - 

#AAAAATGT 4320 

- - CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT 

40 AC - 

#AAAGCGGT 43 80 

- - TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT* CTTGGACTGT 

AT - 

#TTTTAGGT 4440 

45 - - TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC 

AC - 

#CAGCACAG 45 00* - - 

- - TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG 

TG - 

50 #GAAGAACT 45 60' 

- - TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG 

AT - 

#GGCAATGG 462 0 

- - GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA 

55 TA - 

#GTTGTGTT 4680 

- - CCAGGATGAG ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTG 

CC - 

#AGACTGCG 47 40 

60 - ~ GTATAATGGT TCC ATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC 

AT - 

#TTCCCACG 48 00 
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■ - _ CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA AC - 
#GGTTTCCG 4 8.60 

• - - GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC 
5 TT - • 

#ACCGCAGC 492 0 

- - CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA 

GA - 

10 #GCTGCAGC 49 8 0 

- - TGCCGTGATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT 
CG - • 

#CATGTTTT 5040 

- - CCCTGACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT 

15 TG - 

#CAAGGAAG 5100 

- - CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC 

GT - 

#TTGACCAA 5160 

20 - - GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA 

TC - 

#CAGCATAT 5220 

- - CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT 

GC - 

25 #TCGTCCAG 52 80 

- - ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG 

TC - 

#TGGGTCAC 53 40 

- - GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC 
30 TG - . 

#GTCCTGCT 5 400 

- • - GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGGTAGCATT 

TG - 

#ACCATGGT 54 60 

35 - - GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT 

TG - 

#GAGGAGGC 5520 

- - GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA 

GA - - 

40 . #AATACCGA '5580 

: - \ " - - TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT 

CC - . 

#ACGAGCCA 5 640 

- - GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT 

45 TG - '• 

#ATGCGTTT 5700 

- - CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACGCTCGGTG ACGAAAAGGC 

TG - 

#TCCGTGTC 57 60 

50 - - CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTGTT CCGCGGTCCT 

CC - 

#TCGTATAG 5 82 0 

- - AAACTCGGAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG 

AG - 

55 #GCTAAGTG 5 880 

- - GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC* TCCAGGGTGT 

GA - 

# AG AC AC AT 5 940 

- - GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTGTAGGCCA 

60 CG - 

#TGACCGGG 600 0 
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- - TGTTCCTGAA GGGGGGCTAT AAAAGGGGGT GGGGGCGCGT TCGTCCTCAC TC - 
#TCTTCCGC 6060 

- - ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG 

5 CG - . 

#GGCATGAC 6120 

- - TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA 

CC - 

#TGGCCCGC 618 0 

10 - - GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT 

TT - 

#TTGTTGTC 6240 

- - AAGCTTGGTG* GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA 

TG - 

15 #GAGCGCAG 63 00 

- - GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT 

GC - 

#ACGTATTC 6360 

- - GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTGCGC TCGTCGGGCA 
20 CC- . ' - 

#AGGTGCAC 6420 

- - GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT 

CT - 

#CCGCGTAG 6480 

25 - - GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG 

GT - 

#AGGGGGTC 6540 

- - TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCGGGCA 

GC - 

30 #AGGCGCGC 6600 

- - GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG 

CG - 

#CGGGCGGC 6660 

- - AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG 

35 TG - 

#AGCGCGGA 6720 

- - GGCGTACATG CCGCAAATGT C GT AAACGTA GAGGGGCTCT CTGAGTATTC 

CA - . 

#AGATATGT 6780 

40 - - AGGGTAGCAT CTTCCACCGC GGATGCTGGC GCGCACGTAA TCGTATAGTT 

CG - 

. #TGCGAGGG 6840 

- - AGCGAGGAGG TCGGGACCGA GGTTGCTACG GGCGGGCTGC TCTGCTCGGA 
AG - . ' 

45 #ACTATCTG 6900 

- - CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGACGT 

TG - 

#AAGCTGGC 69 60 

- - GTCTGTGAGA CCTACCGCGT CACGCACGAA GGAGGCGTAG GAGTCGCGCA 
" 50 GC - * 

#TTGTTGAC 7 02 0 

- - CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT 

TG - 

#ATGATGTC 7 080 

55 - - ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGGACAAACT 

CT - 

#TCGCGGTC 7140 

- - TTTCCAGTAC TCTTGGATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC 

CT - 

60 # AGCATGTA 7200 

- - G AAC TGGTTG ACGGCCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG 

CG - 
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4TATGCCTG 72 60 

- - CGCGGCCTTC CGGAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA 

5 TG - 

#ACTTTGAG 732 0 

- - GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA 

AA - 

#AAGTCCGT 73 8 0 

10 - - GCGCTTTTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA 

GT - 

#ATCTTTCC 7440 

- - CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTCGG 

AA - 

15 #CGGTTGTT * 750 0 

- - AATTACCTGG GCGGCGAGCA CGATCTCGTC AAAGCCGTTG ATGTTGTGGC 

CC - 

#ACAATGTA 7560 

- - AAGTTCCAAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT 

20 CC - 

#TCGTAGGT 7 62 0 

- - GAGCTCTTCA GGGGAGCTGA GCCCGTGCTC. TGAAAGGGCC CAGTCTGCAA 

GA - 

#TGAGGGTT 7680' 

25 - - GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT 

GG - * 
#TCGCGAAA 7740 

- - GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA 
AG - " 

30 #GTAAGCGG 7800 . 

- - GTCTTGTTCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG 

CA - 

#GTCACTAG 78 60 

- - AGGCTCATCT CCGCCGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT 

35 TC - 

#CCAAAGGC 792 0 

- - CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG 
TG - ' 

#CGAGGATG 79 80 

40 - - CGAGCCGATC GGGAAGAACT GGATCTCCCG C C AC CAATTG GAGGAGTGGC 

>. TA - ' 
y #TTGATGTG. '8040 

- - GTGAAAGTAG AAGTCCCTGC GACGGGCCGA ACACTCGTGC TGGCTTTTGT 

AA - 

45 #AAACGTGC 8100 

- - GCAGTACTGG CAGCGGTGCA CGGGCTGTAC ATCCTGCACG AGGTTGACCT 

GA - 

#CGACCGCG 8160 

- - CACAAGGAAG CAGAGTGGGA ATTTGAGCCC CTCGCCTGGC GGGTTTGGCT 

50 GG - 

#TGGTCTTC 822 0 

- - TACTTCGGCT GCTTGTCCTT GACCGTCTGG CTGCTCGAGG GGAGTTACGG 

TG - 

#GATCGGAC 82 8 0 

55 - - CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA 

GC - 

#TTGATGAC 83 4 0 

- - AACATCGCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA 

GG - 

60 #TCAGGCGG 8400 

- - GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGCG CGGGCTAGAT 

CC - 
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#AGGTGATA 8460 

- - CCTAATTTCC AGGGGCTGGT TGGTGGCGGC GTCGATGGCT TGCAAGAGGC 

CG - 

#CATCCCCG 852 0 

- - CGGCGCGACT ACGGTACCGC GCGGCGGGCG GTGGGCCGCG GGGGTGTCCT 

TG - 

#GATGATGC 8580 

- - ATCTAAAAGC GGTGACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG 

AC - 

#CCGCCGGG 8640 

- - AGAGGGGGCA GGGGCACGTC GGCGCCGCGC GCGGGCAGGA GCTGGTGCTG 

CG - 

#CGCGTAGG 8700 

- TTGCTGGCGA ACGCGACGAC GCGGCGGTTG ATCTCCTGAA TCTGGCGCCT 

CT - 

#GCGTGAAG 87 60 

- - ACGACGGGCC CGGTGAGCTT GAGCCTGAAA GAGAGTTCGA CAGAATCAAT 

TT - 

#CGGTGTCG 8820 

- - TTGACGGCGG CCTGGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG 

AT - 

#AGGCGATC 8880 

- - TCGGCGATGA ACTGCTCGAT CTCTTCCTCC TGGAGATCTC CGCGTCCGGC 

TC - 

#GCTCCACG 8940 

- - GTGGCGGCGA GGTCGTTGGA AATGCGGGCC ATGAGCTGCG AGAAGGCGTT 

GA - 

#GGCCTCCC 9 000 

- - TCGTTCCAGA CGCGGCTGTA GACCACGCCC CCTTCGGCAT CGCGGGCGCG' 

CA - 

#TGACCACC 9060 

- - TGCGCGAGAT TGAGCTCCAC GTGCCGGGCG AAGACGGCGT AGTTTC GC AG 

GC - 

. #GCTGAAAG 912 0 

- - AGGTAGTTGA GGGTGGTGGC GGTGTGTTCT GCC AC GAAGA AGTACATAAC 

CC - 

#AGCGTCGC 9180 

- - AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGGCCTC 
GT - ■ 

#AGAAGTCC 9240 

- - ACGGCGAAGT TGAAAAACTG GGAGTTGCGC GCCGACACGG TTAACTCCTC 

CT - 

#CCAGAAGA " 93 00 

- - CGGATGAGCT CGGCGACAGT GTCGCGCACC TCGCGCTCAA AGGCTACAGG 

GG - 

#CCTCTTCT 93 60 

- - TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT- CTTCTTCTGG 
CG - ' • 

#GCGGTGGG 942 0 

- - GGAGGGGGGA CACGGCGGCG ACGACGGCGC ACCGGGAGGC GGTCGACAAA 

GC - 

#GCTCGATC 9480 

- - ATCTCCCCGC GGCGACGGCG CATGGTCTCG GTGACGGCGC GGCCGTTCTC 

GC - 

#GGGGGCGC 9540 

- - AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT 

GC - 

#CATGCGGC 9 600 

- - AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTGTG TAGGTACTCC 

GC - 

#CGCCGAGG 9 660 
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- - GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GT - 
#CTAACCAG 9720 * . 

- - TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCGGGCGGCA GCGGGCGGCG 

GT - 

SCGGGGTTG 97 8 0 

- - TTTCTGGCGG AGGTGCTGCT GATGATGTAA TTAAAGTAGG CGGTCTTGAG 

AC - 

#GGCGGATG . 9 840 

- - GTCGACAGAA GCACCATGTC CTTGGGTCCG GCCTGCTGAA TGC GCAGGCG 

GT - 

#CGGCCATG .9900 \ 

- - CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT 

GA - 

#GCCTTTCT 99 60 

- - AC C GGCACTT CTTCTTCTCC . TTCCTCTTGT CCTGCATCTC TTGCATCTAT 

CG - 

tCTGCGGCG 10020 

- - GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC 

CC - 

#CGAAGCCC 100 80 

- - CTCATCGGCT GAAGCAGGGC TAGGTCGGCG ACAACGCGCT CGGCTAATAT 

GG - 

#CCTGCTGC 10140 

- - ACCTGCGTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGC GGTGGTA 

TG - 

#CGCCCGTG 10200 

- - TTGATGGTGT AAGTGCAGTT GGCCATAACG GACCAGTTAA CGGTCTGGTG 

AC - 

. #CCGGCTGC 10260 

- - GAGAGCTCGG TGTACCTGAG ACGCGAGTAA GCCCTCGAGT CAAATACGTA 

GT - 

#CGTTGCAA 103 2 0 

- - GTCCGCACCA GGTACTGGTA TCCCACCAAA AAGTGCGGCG GCGGCTGGCG 

GT - 

fAGAGGGGC 103 80 * ' 

- - CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG 

AT - 

#GATATCCG - 10440 

- - T AGATGT AC C TGGACATCCA GGTGATGCCG GCGGCGGTGG TGGAGGCGCG 

CG - 

#GAAAGTCG 10500 

- - CGGACGCGGT TCCAGATGTT ' GCGCAGCGGC AAAAAGTGCT CCATGGTCGG 

GA - 

#CGCTCTGG 105 60 

- - CCGGTCAGGC GCGCGCAATC GTTGACGCTC TAGACCGTGC AAAAGGAGAG ' 

CC - 

#TGTAAGCG 10620 

- - GGCACTCTTC CGTGGTCTGG TGGATAAATT CGCAAGGGTA TCATGGCGGA 

CG - 

#ACCGGGGT 10680 

- - TCGAGCCCCG TATCCGGCCG TCCGCCGTGA TCCATGCGGT TACCGCCCGC 

GT - 

#GTCGAACC 10740 

- - CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC 

AG - 

#GCGCGGCG 10800 

- - GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC GTAAGCGGTT 

AG - 

#GCTGGAAA 10860 

- - GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGGAGGGT TATTTTCCAA 

GG - 
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#GTTGAGTC 10920 

- - GCGGGACCCC CGGTTCGAGT CTCGGACCGG CCGGACTGCG GCGAACGGGG 

GT - 

5 #TTGCCTCC 10980 

- - CCGTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC 

CC - 

#TTTTTTGC '11040 

- - TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA 

10 GC - 

#GGCAAGAG 11100 

- - CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC 
AG - • % 

§GAGGGGCG -11160 

15 - - ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGCGGCG 

CC - 

#GGGCCCGG 11220 

- - CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGCGCC 

CT - 

;20 #CTCCTGAG 11280 

- - CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC 

GC - ' 

#GGCAGAAC 11340 

- - CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA 

25 GT - 

#TCCACGCA 11400 

- - GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA 

GG - 

#ACTTTGAG 11460 

30 - - CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGCGGCCGC 

CG - 

#ACCTGGTA 11520 

- - ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT 

TA - 

35 #ACAACCAC 115 80 

- - GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT 

GT - 

#GGGACTTT .11640 

- - GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT 

40 GT - 

#TCCTTATA 117 00 

: " • - - GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAA.CAT 

AG - 

#TAGAGCCC 117 60 

45 - - GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT 

GC - 

• #AGGAGCGC 1182 0 

- - AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG 

CC ■ 

50 #TGGGCAAG 118 8 0 

- - TTTTACGCCC GCAAGATATA CCATACCCCT TACGTTCCCA TAGACAAGGA 

GG - 

#TAAAGATC 11940 

- - GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA 

55 CC - 

ttTGGGCGTT 12 000 

- - TATCGCAACG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA 
GC - , 

#TCAGCGAC 12 060 

60 - - CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG 

CG - 

#ATAGAGAG 1212 0 
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- - GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG AC - 
#GCGCCCTG .12180 

- - GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG 

5 CA - 

#ACGTCGGC 1224 0 

- - GGCGTGGAGG AATATGACGA GGACGATGAG TACGAGCCAG AGGACGGCGA 

GT - 

#ACTAAGCG 123 00 

10 - - GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG 

GC - 

#GGCGCTGC 123 60 

- - AGAGCCAGCC GTCCGGCCTT* AACTCCACGG ACGACTGGCG CCAGGTCATG 
GA - • 

15 #CCGCATCA 1242 0 

- - TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC 

AA - 

tCCGGCTCT 12480 

- - - CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG 

.20 AA - 

SGGTGCTGG 12 540 

- - CGATCGTAAA CGCGCTGGCC GAAAACAGGG - CCATCCGGCC CGACGAGGCC 

GG - 

SCCTGGTCT 12 600 

25 

- - ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG 

AC - 

#CAACCTGG 12 660 

- ACCGGCTGGT GGGGGATGTG CGCGAGGCCG TGGCGCAGCG TGAGCGCGCG 

30 CA - 

#GCAGCAGG 1272 0 

- - GCAACCTGGG CTCCATGGTT ■ GCACTAAACG CCTTCCTGAG TACACAGCCC 

GC - 

#CAACGTGC 1278 0 

35 - - CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG 

GT - 

#GACTGAGA 12 840 

- - CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAGACC 

AG - 

40 #TAGACAAG 12900 

' - - GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG 

TG - 

#GGGGGTGC 12 960 

- - GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC 

45 TC - 

#GCGCCTGT 13 02 0 

- - TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC 

AC - 

#ATACCTAG 13 080 

50 - - ' GTCACTTGCT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GCATGTGGAC 

GA - 

#GCATACTT 13140 

- - TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC 

AG - 

55 #CCTGGAGG 132 00 

- - C AAC C CT AAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG 

CA - 

#CAGTTTAA 13 2 60 

- - ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT 

60 AA - 

#CCTGATGC 13 32 0 
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- GCGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GA - 
#ACCGGGCA 133 8 0 

- - TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT 



CG - 
CC - 
TG - 



#CGCGGCCG 13440 m 

- - CCGTGAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA 

#GCCCCCTG 1350 0 

10 - - GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC 



#GGACGACA 13560 

- - TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA 

CA - 

15 #GCGCGAGC 13 62 0 

- - AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG 



TC - 
20 GG - 
AA - 
GG - 



#CGATCTAG 13 68 0 * 

- - GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA 

#GTCTCTTA 13740 

- - CCAGCACTCG CACCACCCGC CCGCGCCTGC' " TGGGCGAGGA GGAGTACCTA 

#CAACTCGC 13 8 00* 

25 - - TGCTGCAGCC GCAGCGCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC 



#GATAGAGA 13 860 

- - GCCTAGTGGA CAAGATGAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG 

GA - 

30 #CGTGCCAG 13 92 0 

- - GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG 



• GT - 



35 AA - 



#GTGGGAGG 13980 

- - ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC 

#CCCGTTTG 14 040 

- - CGCACCTTCG CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG 

AT - 

#GCAAAATA 14100 

40 - - AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC 

TT - 

#AGTATGCG 14160 

- - GCGCGCGGCG ATGTATGAGG . AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG 
TG - . 

45 #AGCGCGGC 1422 0 

- - GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC " CTGGACCCGC 

CG .- 

#TTTGTGCC 142 8 0 

- - TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG 

50 AG - 

#TTGGCACC 1434 0 

- - C CTATTC G AC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG 

TG - 

#GCATCCCT 14400 

55 - - GAACTAC C AG AACGACCACA GCAACTTTCT- GACCACGGTC ATTCAAAACA 

AT - 

#G ACT AC AG 144 6 0 

- - CCCGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT 

GG - 

60 #GGCGGCGA 14520 

- - CCTGAAAAC C ATCCTGCATA * CCAACATGCC AAATGTGAAC GAGTTCATGT 

TT - 
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#ACCAATAA 145 80 

- - GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG 

TG - 

5 #GAGCTGAA 14640 

- - ATACGAGTGG GTGGAGTTGA CGCTGCCCGA GGGCAACTAC TCCGAGACCA 

TG - 

#ACCATAGA 147 00 

- - CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA 
10 AC - ' " 

&GGGGTTCT 14760 

- - GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT 

TT* - 

#GACCCCGT 14820 

15 . . - - CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG 

AC - 

#ATCATTTT 14880 

- - GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT 

TG - 

.20 tGGCATCCG 14940 

- - CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG 

AG - 

#GGTGGTAA 15000 

- - CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG 

.25 AC - 

#ACCGAACA 15060 

- - GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG 

AG - 

#AACTCCAA 1512 0 

30 - - CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA 

TT - 

#CGCGGCGA 1518 0 

- - CACCTTTGCC ACACGGGCTG • AGGAGAAGCG CGCTGAGGCC GAAGCAGCGG 

CC - 

35 #GAAGCTGC 1524 0 

- - CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA 

TC - • 

#AAACCCCT 153 00' 

- - GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC AATGACAGCA 

40 CC - 
: v? #TTCACCCA 15360 

- - GTACCGCAGC TGGTACCTTG CATACAACTA CGGCGACCCT CAGACCGGAA 

TC - 

tCGCTCATG 1542 0 

45 - - GACCCTGCTT* TGCACTCCTG ACGTAACCTG CGGCTCGGAG CAGGTCTACT 

GG - " 
#TCGTTGCC 15480 

- - AGACATGATG CAAGACCCCG TGACCTTCCG CTCCACGCGC CAGATCAGCA 

AC - 

50 #TTTCCGGT 15540 

- - GGTGGGCGCC ' GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC 

AG - 

#GCCGTCTA 15600 

. - - ■ CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACGTG TTCAATCGCT 

55 TT - 

#CCCGAGAA 15 660 

- - CCAGATTTTG GCGCGCCCGC CAGCCCCCAC CATCACCACC GTCAGTGAAA 

AC - 

#GTTCCTGC 15720 

60 - - TCTCACAGAT CACGGGACGC TACCGCTGCG CAACAGCATC GGAGGAGTCC 

AG - 

#CGAGTGAC 15780 
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- - CATTACTGAC GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GC - 
#ATAGTCTC 15 84 0 

- - GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA 

5 TA - 

#TCGCCCAG 15900 

- - CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG TTTGGCGGGG 

CC - 

#AAGAAGCG 159 60 

10 - - CTCCGACCAA CACCCAGTGC GCGTGCGCGG GCACTACCGC GCGCCCTGGG 

GC - 

#GCGCACAA 16020 

- - ACGCGGCCGC ACTGGGCGCA CCACCGTCGA TGACGCCATC GACGCGGTGG 

TG - ' 

15 #GAGGAGGC 16080 

- GCGCAACTAC ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA 

TT - 

#CAGACCGT 16140 

- - GGTGCGCGGA GCCCGGCGCT ATGCTAAAAT GAAGAGACGG CGGAGGCGCG 

20 TA - 

#GCACGTCG 16200 

- - CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG GCGGCCCTGC 

TT - 

#AACCGCGC 16260 

25 - - ACGTCGCACC GGCCGACGGG CGGCCATGCG GGCCGCTCGA AGGCTGGCCG 

CG - 

#GGTATTGT 1632 0 

' - - CACTGTGCCC CCCAGGTCCA GGCG AC GAGC GGCCGCCGCA GCAGCCGCGG 

CC - 

30 #ATTAGTGC 16380 

- - TATGACTCAG GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA 

GC - 

#GGCCTGCG 16440 

- - CGTGCCCGTG CGCACCGGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT 

35 AC - 

#TTAGACTC 16500 

- - GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA 

AG ' - 

#CGCAAAAT 16560 * - 

40 - - CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGAGATCTAT GGCCCCCCGA 

AG - 

fAAGGAAGA 16620 

- - GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG 

AT - 

45 #GATGATGA 16680 

- - TGAACTTGAC GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC 

GG - 

#GTACAGTG 16740 

- - GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT 

50 TT - 

#ACGCCCGG 16800 

- - TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG 

AC - 

#GAGGACCT 168 60 

55 - - GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC 

AT - 

# AAGGAC AT 1692 0 

- - GCTGGCGTTG CCGCTGGACG AGGGCAACCC AAC AC C TAG C CTAAAGCCCG 

TA - 

60 . &ACACTGCA 16980 

- GCAGGTGCTG CCCGCGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC 

GC - " 
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#GAGTCTGG 17040 

- - TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG 

AA - 

5 #GATGTCTT 17100 

- - GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC 

CA - 

#ATCAAGCA 17160 

- - GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA 

10 cc - 

#AGTAGCAC 17220 

- - CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG 

CC - " 

#TCAGCGGT 172 80 

15 - - GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA 

CG - 

#GAGGTGCA 17340 

- - AACGGAC CCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT 

CG - 

.20 # AGGAAGTA 17400 

- - CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG 

CG - 

#CCTACCCC- 17460 

- - CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC' 
25 GC - ' 

#CGAACCAC 17520 

- - CACTGGAACC CGCCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA 

TT - 

#TCCGTGCG 1758 0 

30 - - CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT 

AC - 

#CACCCCAG 17 640 

- - CATC GTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT 

GC - 

35 #CGCCTCCG 17700 . . 

- - TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG 

CC - 

#GGCCACGG 177 60 

- - CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC 

40 AC - 

* tCGTCGCAT 17 82 0 

- - GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG 

GC - 

#GCCGTGCC 1788 0 

45 - - CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA 

GT - 

#TGCATGTG 17940 

- - GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA 

CT - 

50 ttATTTTGTA 18000 

- - GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC 

CC - 

#GTTCATGG 1806 0 

- - GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC 
55 TG - . 

#GGGCTCGC 1812 0 

- - TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC 

AA - 

#GGCCTGGA 1818 0 

60 - - ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC 

CA - 

#ACAAAAGG 18240 
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- - TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CA - 
#GGCAGTGC 18300 

- - AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT 
5 CC - ■ 

#ACCGGCCG 183 60 ... 

- - TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC 

AG - 

#GGAAGAAA 1842 0 

10 - - CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGC^CTAAAG 

CA - 

#AGGCCTGC 18480 

- - CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC 

AC - 

15 #ACCCGTAA 18540 

- - CGCTGGACCT GCCTCCCCCG GCCGACACCC AGCAGAAACC TGTGCTGCCA 

GG - 

#CCCGACCG 18600 

- - CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC ■ 

."20 GG - 

#TCCGCGAT 18660 

- - CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC 

GT - 

#GGGTCTGG 18720 

25 - - GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC- TAACGTGTCG 

T A - ' 
#TGTGTGTC 18780 

- - ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC 

CG - 

30 #CTTTCGAA ■ 18840 

- - GATGGCTACC CCTTCGATGA TGCCGCAGTG . GTCTTACATG CACATCTCGG 

GO - 

#CAGGACGC 18900 

- - CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA 

35 CG - 

#TACTTCAG 189 60 

- - CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGCGCCTACG CACGACGTGA 

CC - 

• #ACAGACCG 19 020 
40 - - GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG 

CG - . 

#TACTCGTA 19080 

- - CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG 

CT - 

45 #TCCACGTA 19140 

- - CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT 

CT - 

#GGCACTGC 192 00 

- - CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG 

50 AA - 

#GCTGCTAC 192 60 

- - TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG 

TA - 

#GACGAGCA 1932 0 

55 - - AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA 

TA - 

# AAT ATTAC 193 8 0 

- - AAAGGAGGGT ATTCAAATAG GTGTCGAAGG TCAAACACCT AAATATGCCG 

AT - 

60 ttAAAACATT 19440 

- - TCAACCTGAA CCTCAAATAG GAGAATCTCA GTGGTACGAA ACTGAAATTA 

. AT - 
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#CATGCAGC 19500 

- - TGGGAGAGTC CTTAAAAAGA CTACCCCAAT GAAACCATGT TACGGTTCAT 

AT - 

5 #GCAAAACC 195 60 

- - CACAAATGAA AATGGAGGGC AAGGCATTCT TGTAAAGCAA CAAAATGGAA 

AG - 

#CTAGAAAG 19620 

- - TCAAGTGGAA ATGCAATTTT TCTCAACTAC TGAGGCGACC GCAGGCAATG 

10 GT - 

#GATAACTT 19 680 

- - GACTCCTAAA GTGGTATTGT ACAGTGAAGA TGTAGATATA GAAACCCCAG 

AC - 

#ACTCATAT 19740 

1*5 . - - TTCTTACATG CCCACTATTA AGGAAGGTAA CTCACGAGAA CTAATGGGCC 

AA - 

#CAATCTAT 19800 

- - GCCCAACAGG CCTAATTACA ■ TTGCTTTT AG GG AC AATTTT ATTGGTCTAA 

TG - 

■20 #TATTACAA 198 60 

- - CAGCACGGGT AATATGGGTG TTCTGGCGGG CCAAGCATCG CAGTTGAATG 

CT - 

tGTTGTAGA 1992 0 

- - TTTGCAAGAC AGAAACACAG AGCTTTCATA CCAGCTTTTG CTTGATTCCA 

25 TT - 

#GGTGATAG 19980 

- - AACCAGGTAC TTTTCTATGT GGAATCAGGC TGTTGACAGC TATGATCCAG 

AT - 

#GTTAGAAT 2 0040 

30 - - TATTGAAAAT CATGGAACTG AAGATGAACT TCCAAATTAC TGCTTTCCAC 

TG - 

#GGAGGTGT 2 0100 

- - GATTAATACA GAGACTCTTA CCAAGGTAAA ACCTAAAACA GGTCAGGAAA 

- AT - 

35 #GGATGGGA 2 0160 

- - AAAAGATGCT ACAGAATTTT CAGATAAAAA TGAAATAAGA GTTGGAAATA 

AT - 

#TTTGCCAT 2 0220 

- - GGAAATCAAT ' CTAAATGCCA ACCTGTGGAG AAATTTCCTG TACTCCAACA 

^ 40 TA - 

#GCGCTGTA 2 0280 

- - TTTGC C CG AC AAGCTAAAGT ACAGTCCTTC CAACGTAAAA ATTTCTGATA 

' AC - 

45 #CCAAACAC 2 0340 

- - CTACGACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT 

AC - 

4ATTAACCT 2 0400 

- - TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC 

50 AC - 

#CACCGCAA- 2 0460 

- - TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC 

CC - 

STTCCACAT 2 052 0 

55 - - CCAGGTGCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG 

GC - 

# TCAT AC AC 2 058 0 

- - CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC 

TA - 

60 #GGAAATGA 2 0 640 

. - - CCTAAGGGTT GACGGAGCCA GCATTAAGTT TGATAGCATT TGCCTTTACG 

CC - • 
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#ACCTTCTT 2 0700 

- - CCCCATGGCC CACAACACCG CCTCCACGC1 TGAGGCCATG CTTAGAAACG 

AC - 

5 # ACCAACGA ■ 2 07 60 

- - CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC 

CC - 

#GCCAACGC 2 0 820 

- - TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCCGCG 

10 GC - 

#TGGGCCTT 2 0880 

- - CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC 

CT - 

#TATTACAC 2 0940 * 
15 - - CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA 

CC - 

#TTTAAGAA 2100 0 

- - GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC 

TG - 

•20 #CTTACCCC- 21060 

• - - CAACGAGTTT . GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACGTTG 
CC - * 
#CAGTGTAA 2112 0 

- - C ATG AC C AAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG 

25 GC - 

.#TACCAGGG 2118 0 

- - CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA 

AC - 

#TTCCAGCC 21240 

30 - - CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG 

TG - 

#GGCATCCT 213 00 

- - ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC 

GC - 

35 #GAAGGACA 213 60 

- - GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG 

AC - 

#AGCATTAC 21420 

- - CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TTGGCGCATC CCATTCTCCA 
.. 40 • GT - . 

• #AACTTTAT 21480 

- - GTCCATGGGC GCACTCACAG ACCTGGGCCA A&ACCTTCTC TACGCCAACT 

CC - 

#GCCCACGC 21540 

45 -* - GCTAGACATG ACTTTTGAGG TGGATC CC AT GGAC GAGCCC AC CCTTCTTT 

AT 

#GTTTTGTT 2160 0 

- - TGAAGTCTTT GACGTGGTCC GTGTGCACCG -GCCGCACCGC GGCGTCATCG 

AA - 

50 #ACCGTGTA 21660 

- - CCT'GCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC 

AA - 

#CATCAACA 2172 0 

- - ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCATTGTCAA 

55 AG - 

SATCTTGGT 217 80 

- - TGTGGGCCAT ATTTTTTGGG CACCTATGAC AAGCGCTTTC CAGGCTTTGT 

TT - 

#CTCCACAC 21840 

60 - - AAGCTCGCCT GCGCCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT 

AC - 

#ACTGGATG 219 00 
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- - GCCTTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TG - 
#GCTTTTCT 219 60 

- - GACCAGCGAC TCAAGCAGGT TTACCAGTTT GAGTACGAGT CACTCCTGCG - 
5 CC - , " 

#GTAGCGCC 22020 * • 

- - ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG 

CG - 

#TACAGGGG 22 080 

10 - - CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT 

TG - 

#CCAACTGG 22140 

- - CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT 

AC - 

15 #CCAACTCC 22200 

- - ATGCTCAACA GTCCCCAGGT ACAGCCCACC CTGCGTCGCA ACCAGGAACA 

GC - 

#TCTACAGC 22260 

- - TTCGTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAG" 

■20 GA - 

#GCGCCACT 22320 

_ _ TCTTTTTGTC ACTTGAAAAA CATGTAAAAA " TAATGTACTA GAGACACTTT 

" CA - 

#ATAAAGGC 223 80 

25 - - AAATGCTTTT ATTTGTACAC TCTCGGGTGA TTATTTACCC CCACCCTTGC" 

CG - 

#TCTGCGCC 22440 

- - GTTTAAAAAT CAAAGGGGTT CTGCCGCGCA TCGCTATGCG CCACTGGCAG 

GG - 

30 #ACACGTTG 22500 

- - CGATACTGGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG 

. C A - 

#GGTCGGTG 22560 

- AAGTTTTCAC ' TCCACAGGCT GCGCACGATC ACCAACGCGT TTAGCAGGTC 

35 GG - 

#GCGCCGAT 22620 

- - ATCTTGAAGT CGCAGTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA 

CA - 

#CAGGGTTG 22680 

...40 - - CAGCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT 

- > CT - 

"'' #TGTCGGAG 22740 

- - ATCAGATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGGAGTCAA 

CT - 

45 #TTGGTAGC 228 00 

- TGCCTTCCCA AAAAGGGCGC GTGCCCAGGC TTTGAGTTGC ACTCGCACCG 

TA - 

#GTGGCATC 2 2 860 

- - AAAAGGTGAC CGTGCCCGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA 

50 AG - 

#CCTTGATC 2292 0 

- - TGCTTAAAAG CCACCTGAGC CTTTGCGCCT TCAGAGAAGA ACATGCCGCA 

AG - 

#ACTTGCCG 229 80 

55 - - G AAAAC TG AT TGGCCGGACA GGCCGCGTCG TGCACGCAGC ACCTTGCGTC 

GG - 

#TGTTGGAG 23 040 

- - ATCTGCACCA CATTTCGGCC CCACCGGTTC TTCACGATCT TGGC CTTGCT 

AG - 

60 #ACTGCTCC 23100 

- - TTCAGCGCGC GCTGCCCGTT TTCGCTCGTC ACATCCATTT CAATCACGTG 

CT - 
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tCCTTATTT 23160 

- - ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGCGCA 

. GC - 

5 #GGTGCAGC 23220 

- - CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA 

CG - 

#ACTGCAGG 23 280 

- - TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGGTCT TGTTGCTGGT 

10 GA - 

^AG'GTCAGC 23340 

- - TGCAACCCGC GGTGCTCCTC GTTCAGCCAG GTCTTGCATA CGGCCGCCAG 

AG -■ ' 

#CTTCCACT 23400 

15 - - TGGTCAGGCA GTAGTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA 

CT - 

#TGTCCATC 23460 

- - AGCGCGCGCG - CAGCCTCCAT GCCCTTCTCC CACGCAGACA CGATCGGCAC 

AC - 

20 #TCAGCGGG 23 520 

- - TTCATCACCG TAATTTCACT TTCCGCTTCG CTGGGCTCTT CCTCTTCCTC 
TT - • 

#GCGTCCGC 23 5 80 

- - ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTGTGCGCTT 
25 • AC - • 

#CTCCTTTG '23 640 

- - CCATGCTTGA TTAGCACCGG TGGGTTGCTG AAACCCACCA TTTGTAGCGC 

CA - ' 

#CATCTTCT 23700 

30 - - CTTTCTTCCT CGCTGTCCAC GATTAC CTCT GGTGATGGCG GGCGCTCGGG 

CT - 

#TGGGAGAA 237 60 

- - GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG CCGCCGAGGT 

CG - 

35 #ATGGCCGC 2382 0 

- - GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC 

CT? - 

#CGGACTCG 23880 

- - ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCGA 

40 CG - 

#GGGACGGG 23940 

- - GACGACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG 
CT - ' 

#CGGGGGTG 24000 

45 - - GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA 

GA - 

#AAAAGATC 24060 

- - ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC 

CA - 

50 #CCACCGCC 2412 0 

- - TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC 

GC - 

#TTGAGGAG 2 418 0 

- - GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA 
55 GG - ' 

#ACCGCTCA 2 4240 

- - GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA 

GG - 

#AACAAGTC 24300 

60 - - GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAGACGACGT 

GC - 

#TGTTGAAG 24360 
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- CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CG - 
#ATGTGCCC 2 4420 

- - CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC 

5 GC - 

#GCGTACCC 24480 

- - CCCAAACGCC AAGAAAACGG CACATGCGAG CCCAACCCGC GCCTCAACTT 

CT - 

#ACCCCGTA 24540 

10 - - TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG 

CA - 

#AGATACCC 24600 

- - CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG 

GC - 

15 #AGGGCGCT 24660 

- - GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG 

TC - 

#TTGGACGC 24720 

- - GACGAGAAGC GCGCGGCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA 

,10 AA - 

#GTCACTCT 24780 

- - GGAGTGTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA 

AC - 

#GCAGCATC 24840 

25 - - GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT 

GA - 

#GCACAGTC 24900 

- - ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA 

TT - 

30 #TGCAAGAA . 24960 

- - CAAACAGAGG AGGGCCTACC CGCAGTTGGC GACGAGCAGC TAGCGCGCTG 

GC - 

#TTCAAACG 25 020 

- - CGCGAGCCTG CCGACTTGGA GGAGC'GACGC AAACTAATGA TGGCCGCAGT 

35 GC - 

#TCGTTACC 25 080 

- - ' GTGGAGCTTG AGTGCATGCA GCGGTTCTTT GCTGACCCGG AGATGCAGCG 

CA - 

#AGCTAGAG 25140 

- 40 - - GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACGCC AGGCCTGCAA 

.1 GA - 

#TCTCCAAC 252 00 . 

-* - GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG 

CC - 

45 #TTGGGCAA 25260 

- - AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGCGCCGCG ACTACGTCCG 

CG - 

#ACTGCGTT 2532 0 

- - TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA 

50 GT - 

#GCTTGGAG 253 8 0 

- - GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA 

CC - 

STATGGACG 25 440 

55 - - GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCGGACA TCATTTTCCC 

CG - 

wAACGCCTG 25 500 

- - CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT 

GC - 

60 #AGAACTTT 255 60- 

- - AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA' CCTGCTGTGC 

AC - 
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#TTCCTAGC 2 5 620 

- - GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCCGCCGC TTTGGGGCCA 

CT - 

5 #GCTACCTT 25 68 0 

- - CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGGAAGACGT ' 

GA - 

#GCGGTGAC 25740 

- - GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC CGCACCGCTC 

10 CC - 

#TGGTTTGC 25800 

■ * - - AATTCGCAGC TGCTTAACGA AAGTCAAATT ATCGGTACCT TTGAGCTGCA 

GG - 

#GTCCCTCG 25860 

15 • . - CCTGACGAAA AGTCGGCGGC • TCCGGGGTTG AAACTCACTC CGGGGCTGTG 

GA - 

#CGTCGGCT 2592 0 

- - TACCTTCGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT 

CT - 

30 4ACGAAGAC 25980 

- - CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTGCGTCA TTACCCAGGG 

CC - 

#ACATTCTT 2 6040 

- - GGCCAATTGC AAGCCATCAA CAAAGCCCGC CAAGAGTTTC TGCTACGAAA 

25 GG - 

#GACGGGGG 26100 

- - GTTTACTTGG. ACCCCCAGTC CGGCGAGGAG CTCAACCCAA TCCCCCCGCC 

GC - 

#CGCAGCCC 2 6160 

30 - - TATCAGCAGC AGCCGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA 

AG - 

#CTGCAGCT 26220 

- - GCCGCCGCCA CCCACGGACG AGGAGGAATA CTGGGACAGT CAGGCAGAGG 

AG - 

35 #GTTTTGGA 2 6280 

- - CGAGGAGGAG GAGGACATGA TGGAAGACTG GGAGAGCCTA GACGAGGAAG 

CT - 

#TCCGAGGT 2 6340 

- - CGAAGAGGTG TCAGACGAAA CACCGTCACC CTCGGTCGCA TTCCCCTCGC 
.40 CG - . 

) #GCGCCCCA 26400 

- - GAAATCGGCA ACCGGTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC 

CG - 

#CCGGCACT 2 6460 

45 - - GCCCGTTCGC CGACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG 

GT - 

#AAGTCCAA 2 6520 

- - GCAGCCGCCG CCGTTAGCCC AAGAGCAACA ACAGCGCCAA GGCTACCGCT 

CA - 

50 #TGGCGCGG 2 6580 

- - GCACAAGAAC GCCATAGTTG CTTGCTTGCA AGACTGTGGG GGCAACATCT 

CC - 

#TTCGCCCG 2 6640 

- - CCGCTTTCTT CTCTACCATC ACGGCGTGGC CTTCCCCCGT AACATCCTGC 
55 AT - * 

#TACTACCG 2 6700 

- - TCATCTCTAC AGCCCATACT GCACCGGCGG CAGCGGCAGC GGCAGCAACA 

GC - 

#AGCGGCCA 2 6760 

60 - - CACAGAAGCA AAGGCGACCG GATAGCAAGA CTCTGACAAA GCCCAAGAAA 

TC - 

#CACAGCGG 2 6820 
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- - CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CGAACCCGTA TC - 
#GACCCGCG 2 6880 

- - AGCTTAGAAA CAGGATTTTT CCCACTCTGT ATGCTATATT TCAACAGAGC 
5 AG - . 

^GGGCCAAG 26940 

- - AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCGATCCCT CACCCGCAGC 
TG - • 

#CCTGTATC 27000 

10 - - ACAAAAGCGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC 

TT - 

#CAGTAAAT 27060 

■ • - - ACTGCGCGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA 
GC - 

15 #GCGAAAAC 2712 0 

- - TACGTCATCT CCAGCGGCCA CACCCGGCGC CAGCACCTGT .CGTCAGCGCC 

AT - 

#TATGAGCA 2718 0 

- - AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGGACTT 

20 GC - 

#GGCTGGAG 27240 

- - CTGCCCAAGA CTACTCAACC CGAATAAACT ACATGAGCGC GGGACCCCAC 

AT - 

#GATATCCC 27300 

25 ' - - GGGTCAACGG AATCCGCGCC CACCGAAACC GAATTCTCTT GGAACAGGCG 

GC - 

#TATTACCA 273 60 . 

- - CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG 

TA - 

30 #CCAGGAAA 27420 

- - GTCCCGCTCC CACCACTGTG GTACTTCCCA GAGACGCCCA GGCCGAAGTT 

CA - 

#GATGACTA 27480- 

- - ACTCAGGGGC GCAGCTTGCG GGCGGCTTTC GTCACAGGGT GCGGTCGCCC 

35 GG - 

#GCAGGGTA 27540 

- - TAACTCACCT GACAATCAGA GGGC GAGGTA TTCAGCTCAA CGACGAGTCG 
GT - * 

#GAGCTCCT 27 600 

.40 - - CGCTTGGTCT CCGTCCGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT 

•1 CC - 

#TTCATTCA 27 660 

- - CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC 

TC - 

45 #TGGAGGCA 2772 0 

- - TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC 

CC - 

#CTTCTCGG 2 778 0 - 

- - GACCTCCCGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA 

50 AA - 

ttGGACTCGG 27 840 

• - - CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG" 

AA - 

#ACACCTGG 27900 

55 

- - TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC 

TA - 

#CTTTGAAT 27 9 60 

- - TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC 

60 CA - 

#GGGAGAGC 2 8 020 
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- - TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CG - 
#GGACAGGG 2 8 080 

- - GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT 

5 CA - 

#AGATCTTT 2 8140 

- - GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG 

GG - 

#GCTCCTAT 2 8200 

10 . - - CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG 

AA - 

#CCTTACCT 2 82 60 

- - GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA 

CG - 

15 #GAGTGAGT 2 8320 

- - CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC 

CC - 

#TCCTTACC 28380 

- - TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC 
20 TG - . , 

#ACCGTAAA 28440 

- - CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG 

GT - 

#GAGCTTAG 2 8500 

25 - - AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA 

AC - 

#AATTCAAG 28560 

- CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG 

TT - 

30 #ATTCTCTG 2 8620 

- - TCTTGTGATT CTCTTTATTC TT AT AC T AAC GCTTCTCTGC CTAAGGCTCG 

CC - 

#GCCTGCTG 2 8680 

- - TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC 

35 CA - 

#AGATGATT 2 8740 

- - AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGT AC C AC 

CC - 

#AAAAGGTG 2880-0 . 
• 40 - - GATTTtAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA 

J GT - 

#GCACCACT 2 8860 

- - CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA 

CA - 

45 #AAATTGGC 2 8920 

- AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA 

TG - 

# TT AC AGTT 2 8 9 8 0 

- - TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA 

50 AA' - 

#TGTGCGAC 29040 

- - ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGC CCC CACAAAATTG 

TG - 

# TGG AAAAC 29100 * • 

55 - - ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TCGCTTTGGT 

CT - 

#GTACCCTA. 2 9160 

- - CTCTATATTA AATACAAAAG - C AGACGCAGC TTTATTGAGG AAAAGAAAAT 

GC - 

60 #CTTAATTT 29220 

- - ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC 

AA - 
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#AACAAATT 292 80 

- - CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCGC CCGGTCATTT 

CC - 

5 #TGCTCAAT 29340 

- - ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA 

AC - 

#CTTGAAGT . 29400 

- - CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG 
10 AT - • • 

#TTGTTCCA 29460 

- GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC 

GG - 

#CCGCCGCT 29520 ■ 

15 - - ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA 

TA - 

#ACTGGGAT 29580 

- - AACTTGGGCA TGTGGTGGTT CTCCATAGCG CTTATGTTTG TATGCCTTAT 

TA - 

? f 0 #TTATGTGG 29 640 

- - CTCATCTGCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC 

CA - 

. #TCATTGTG 297 00 

- - CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT 

25 GT - 

#TCTTTTCT 29760 

- - CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG 

AC - 

#CCTTGTTG 29 820 

30 - - CGCTTTTTTG TGCGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA 

GA - 

#CTGCATTC 29 880 

- - CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTGTCACCCT CACGCTCATC 
TG • 

35 #CAGCCTCA 29940 

- - TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC 

- TT - 

#TGCATATC 30000 

- - TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA 

...40 AT - 
> 4TCTTTAAT 3 0060 

- - TATGAAATTT ACTGTGACTT TTCTGCTGAT TATTTGCACC CTATCTGCGT 

TT - - 

#TGTTCCCC 30120 

45 - - GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA 

AT - ' 

#ATTCCAAG 3 0180 

- - TTGCTACAAT GAAAAAAGCG ATCTTTCCGA " AGC CTGGTT A TATGCAATCA 

TC - 

50 #TCTGTTAT 3 0240 

- - GGTGTTCTGC AGTACCATCT TAGCCCTAGC T AT AT ATC CC T AC CTTG AC A 

TT - 

#GGCTGGAA 3 03 00 

- - AC G AAT AG AT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTA TGCTTC 

55 CA - 

#CTGCAACA 3 0360 

- - AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC 

CC - ■ 

#ACCCCCAC 3042 0 

60 - - TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT 

CT - 

SAGAAATGG 3 0480 
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- - ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GA - 
#GCAAC AGC 3 0540 

- - GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGGAAAAGG 

GG - 

#TATCTTTT 3 0600 

- - GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC 

CG - 

#CCTTAGCT 3 0660 

- - ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG 

CC - 

#CATTACCA 3 0720 

- - TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA 

GG - 

#ACCTGAGG 3 0780 

- ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC 

TT - 

# TAACTAAT 3 0840 

■ ■ - - AAAAAAAAAT. AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT 

GT - 

#CCAGTTTA 30900 

- - TTCAGCAGCA CCTCCTTGCC ' CTCCTCCCAG CTCTGGTATT GCAGCTTCCT 

CC - 

• #TGGCTGCA 30960 

- - AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC 

AT - 

#CCGCACCC 3102 0 

- - ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC 

CT - 

#TCAACCGC 3108 0 

- - GTGT ATC CAT ATGACACGGA AACCGGTCCT CCAkCTGTGC CTTTTCTTAC 

TC - 

#CTCCCTTT 31140 * 

- - GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG 

CC - 

#TATCCGAA 3120 0 

- - CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT 

CT - 

#CTCTGGAC 312 60 

- - GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT 

CA - 

#AAAAAACC 3132 0 

- - AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA 

AG - 

#CCCTAACT 313 80 

- - GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA 

AT - 

#CACAGGCC 31440 

- - CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT 

CA - 

#CAGTGTCA 31500 

- - GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG 

CA - 

#GTACCCTT 31560 

- - ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GC TTGGGC AT 

TG - 

#ACTTGAAA 31620 

- - GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC 

TT - 

#TGCATGTA 31680 

- - ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT 

TA - 
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#ATAATACT 31740 

- - TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA 

TA' - 

5 #TGCAACTT 31800 

- - AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT 

TG - 

#ATGTTAGT 318 60 

- - TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC 

10 TC*- 

#TTTTTATA 3192 0 

- - AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT 

TA - 

#CAGCTTCA 3198 0 

15 - - AACAATTCCA AAAAGCTTGA GGTTAACCTA AGCACTGCCA AGGGGTTGAT 

GT - ' 

#TTGACGCT 3 2 040 
' - - ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA 

TG - * 
-*pq #CACCAAAC 32100 

- . - ACAAATCCCC TCAAAACAAA AATTGGC CAT GGCCTAGAAT TTGATTCAAA 

CA - 

#AGGCTATG 32160 
• - - GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC 

25 AG - 

# TAGGAAAC 3222 0 

- - AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA 

CT - 

#GTAGACTA 32280 

30 - - AATGC AGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG 

TC - 

#AAATACTT 32340 

- - GCTACAGTTT CAGTTTTGGG TGTTAAAGGC AGTTTGGCTC CAATATCTGG 

AA - 

35 #CAGTTCAA 3 2400 

- - AGTGCTCATC TTATTATAAG ATTTGAC GAA AATGGAGTGC TACTAAACAA 

TT - 

#CCTTCCTG 324 60 

- - GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC 

^40 CT - 
) # ATACAAAC 3 252 0 

- - GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA 

AA - 

#CTGCCAAA 32580 

45 - - AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT 

AA - 

#CACTAACC 32 640 

- ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA 

CT - 

50 #CTATGTCA 32700 

- - TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC 

AT - 

#CCTCTTAC 327 60 

- - AC TTTTTC AT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA 

55 AC - 

#GTGTTTA.T 32 820 

- - TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC 

CA - 

#CCACCACA 32 880 

60 - - TAGCTTATAC AG ATC AC CGT ACCTTAATCA AACTCACAGA ACCCTAGTAT 

TC - 

ftAACCTGCC 32 940 
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FIGURE 6 (28/30) 

- - ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TA - 
#AAAAGCAT 33000 

- - CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT 

5 CC - 

#TGTCGAGC 3 3060 

- - CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT 

TC - 

#ATGTCGCT 3312 0 

.0 - - GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG 

' GC - 

#GGCGAAGG 33180 . 

- - AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG 

GG - 

[5 - #CGGTGGTG 33240 

— - CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG 

AA - 

#TACAACAT 333 00 

- - GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC 

20 TT - 

#GTCCTCCG 33360 

- - GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC 

' AC - 

#AGCACCAC 3 3420 . 

25 - - AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG 

CG - 

#GGGACCAC 33480 

- - AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGGCGAC 

CC* - 

30 # CTCATAAA 33540 

- - CACGCTGGAC ATA&ACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT 

CC - 

#CGGTACCA 33600 

. - - TATAAACCTC TGATTAAACA TGGCGC CATC CACCACCATC CTAAACCAGC 

35 . TG - 

#GCCAAAAC 33660 

- - CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT 

GG - 

#AGAGCCCA 33720 

40 - - GGACTCGTAA CCATGGATCA • TCATGCTCGT CATGATATCA ATGTTGGCAC 

) AA - 

#CACAGGCA 33780 

- - CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA 

TA - 

45 #TCCCAGGG 33 840 

- - AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC 

GC - 

#ACGTAACT 33900 

- - CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT 

50 CC - 

#AGTATGGT 33960 

- - AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG- ACGATCCCTA CTGTACGGAG 

TG - 

#CGCCGAGA 34020 

55 - - CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG 

TA - 

# GTC ATATT 340 80 ' 

- - TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG 

TC - 

60 #TCGCCGCT. 3 414 0 

- - TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC 

AG - 
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FIGURE 6 (29/30) 

#GCGCCCCC 3 4200 

- - TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTGATAACA 

TC - 

5 #CACCACCG 3 4260 

- - CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC 
AC - . 

#GGGAGGAG 34320 

- - CGGGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC ■ 

10 CA 

#AAACCTCA 34380 

- - AAATGAAGAT CTATTAAGTG AACGCGCTCC CCTCCGGTGG CGTGGTCAAA 

CT - 

#CTACAGCC 3 4440 

15 - -.AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG 

GC - 

#AAACGGCC 34500 

- - CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT GAATCTCCTC 

TA - 

20 #TAAACATT 34560 

- - CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA 

TA - 

#TATCTCTA 34620 

- - AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT GCTCCAGAGC 

25 GC - 

#CCTCCACC 34680 

- - TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG 

AC - 

#CTGTATAA 34740 

30 - - GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG TAGGTCCCTT 

CG - 

#CAGGGCCA 348 00 

- - GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC CACTTCCCCG 

CC - 

35 #AGGAACCT 3 4860 

- - TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA 

AC - 

#CAGCGTAG 3 492 0 

- - CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT GCAAGGTGCT 

.40 GC - 

:T J #TCAAAAAA 34980 

- - TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG 

CA " 

#GATAAAGG 35040 

45 - - CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT TTCTCTCAAA 

CA - 

#TGTCTGCG 35100 

- - GGTTTCTGCA TAAACACAAA ATAAAATAAC AAAAAAACAT TTAAACATTA 

GA - 

50 &AGCCTGTC 35160 

- - TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC TACGGCCATG 

CC - 

#GGCGTGAC 3 5 220 . 

- - CGTAAAAAAA CTGGTC AC CG TGATTAAAAA GCACCACCGA CAGCTCCTCG 

55 GT - 

irCATGTCCG 3 5280 

_ - GAGTCATAAT' GTAAGACTCG GTAAACACAT CAGGTTGA.TT CATCGGTCAG 

TG - 

#CTAAAAAG 35340 

60 - - CGACCGAAAT AGCCCGGGGG AATAC AT AC C CGCAGGCGTA GAGACAACAT 

TA - 

#CAGCCCCC 35400 
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FIGURE 6 (30/30) 

- ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA. AA - 
. #AACCCTCC 35460 

- TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC 



AC - 



#AGCGGCAG . 35520 

- - CCTAkCAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC 

AC - 

10 #TCGACACG 3 5580 

- - GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG 



TA - 



#TATATAGG 3 5640 

- - ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC 

15 GC - 

#ACGCGAAC 35700 

- - CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA 

CT - 

#TCCGTTTT 3 5760 

•20 - - CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT 

■ AC - * 
#AAGTTACT 3 5 820 

- - CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC CGCGCCACGT 

CA - 

25 #CAAACTCC 35880 

- - ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA 
TG - #ATG 

35935 



30 
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SEQUENCE LISTING 

<110> TRANS GENE S.A. 

<120> Adenovirus protein IX, its domains involved in capsid 
assembly, transcriptional activity, and nuclear 
reorganization 

<130> pIX mutants 

<140> 
<141> 

<160> 33 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 140 
<212> PRT 

<213> Adenovirus type 5 
<400> 1 

Met Ser Thr Asn Ser Phe Asp Gly Ser lie Val Ser Ser Tyr Leu Thr 
15 10 15 

Thr Arg Met Pro Pro Trp Ala Gly Val Arg Gin Asn Val Met Gly Ser 
20 25 30 

Ser lie Asp Gly Arg Pro Val Leu Pro Ala Asn Ser Thr Thr Leu Thr 
35 40 45 

Tyr Glu Thr Val Ser Gly Thr Pro Leu Glu Thr Ala Ala Ser Ala Ala 
50 55 " 60 

Ala Ser Ala Ala Ala Ala Thr Ala Arg Gly lie Val Thr Asp Phe Ala 
65 70 75 80 

Phe Leu Ser Pro Leu Ala Ser Ser Ala Ala Ser Arg Ser Ser Ala Arg 
85 90 95 

Asp Asp Lys Leu Thr Ala Leu Leu Ala Gin Leu Asp Ser Leu Thr Arg 
100 105 110 

Glu Leu Asn Val Val Ser Gin Gin Leu Leu Asp Leu Arg Gin Gin Val 
115 120 125 

Ser Ala Leu Lys Ala Ser Ser Pro Pro Asn Ala Val 
130 135 140 



<210> 2 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 
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<400> 2 

gacaacgcgc atgcccccag gggtgcgtca gaatgtg 



37 



<210> 3 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 3 

cacattctga cgcacccctg ggggcatgcg cgttgtc 37 



<210> 4 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 



<210> 5 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 5 

ccccggccca tgggggcatg cgcgttgtgc tcacaatgct tccatc 46 



<210> 6 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 



<400> 4 

gatggaagca ttgtgagcac aacgcgcatg cccccatggg ccgggg 



46 



<400> 6 

cccccatggg ccggggtgct cgacgtgatg ggctccagc 



39 



<210> 7 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 7 

cgacctcggg tagtggtcga gcaccccggc ccatggggg 

<210> 8 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 8 

gggccggggt gcgtcagaat gtgatgcata tgcccgtcct gcccgc 

<210> 9 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 9 

gcgggcagga cgggcatatg catcacattc tgacgcaccc cggccc 

<210> 10 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 10 

ggattctttg acccgggagc ccaatgtcgt ttctcagc 

<210> 11 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 11 

gctgagaaac gacattgggc tccgggtcaa agaatcc 
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<210> 12 
<212> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 12 

gggaacttaa tgtcgactct cagcagctgt tgg 



<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 13 

ccaacagctg ctgagagtcg acattaagtt ccc 

<210> 14 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 14 

ggattctttg acccgggagc ccaatgtcga ctctcagcag ctgttgg 



<210> 15 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 15 

ccaacagctg ctgagagtcg acattgggct ccgggtcaaa gaatcc 

<210> 16 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 16 

cgccgttgga gactccatgg accgcccgcg gg 

<210> 17 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 17 

cccgcgggcg gtccatggag tctccaacgg eg 

<210> 18 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 18 

ggattctttg acccggaaac ttaatgtcgt ttctcagc 

<210> 19 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 19 

gctgagaaac gacattaagt ttcegggtea aagaatcc 

<210> 20 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 20 

gaeggctett ttggcaaagc ttgattcttt gaeceggg 



<210> 21 
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<211> 38 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 21 

cccgggtcaa agaatcttgc tttgccaaaa gagccgtc 38 



<210> 22 

<211> 52 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 22 

cgccagcagg tttctgccct gggatccaag gcttcctccc ctcccaatgc gg 52 



<210> 23 

<211> 52 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 23 

ccgcattggg aggggaggaa gccttggatc ccagggcaga aacctgctgg eg 52 



<210> 24 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 24 

gatccaagaa gaagaagaag aagaagtaag 3 0 



<210> 25 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 
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<400> 25 

gatccttact tcttcttctt cttcttcttt gatcccgccg acgcggaggc ggagccgctg 60 

<210> 26 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: muatgenesisi 
oligonucleotide 

<400> 26 

gatccagcgc ctccgcctcc gcgtcggcgg gatcaaagaa gaagaagaag aagaagtaag 60 

<210> 27 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 27 

gatccttact tcttcttctt cttcttcttt gatcccgccg acgcggaggc ggagccgctg 60 
gatccg 66 

<210> 28 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 28 

gctgttggat ctgcgccagc agaagaagaa gagatctgcc ctgaaggctt cctcccctcc 60 
caatgcgg 68 

<210> 29 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 



<400> 29 

ccgcattggg aggggaggaa gccttcaggg cagatctctt cttcttctgc tggcgcagat 60 
ccaacagc 68 
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<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide - 

<400> 30 

gctgttggat ctgcgccagc agaagaagaa gaagaaaaaa aaataatctg ccctgaaggc 60 
ttcctccpcct cccaatgcgg 80 
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<210> 31 

<211> 80 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: mutagenesis 
oligonucleotide 

<400> 31 

ccgcattggg aggggaggaa gccttcaggg cagattattt ttttttcttc ttcttcttct 60 
gctggcgcag atccaacagc 80 



<210> 32 
<211> 134 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: mutant pIX 
connected with a 7K binding moiety 

<40 0> 32 

Met Ser Thr Asn Ser Phe Asp Gly Ser lie Val Ser Ser Tyr Leu Thr 
15 10 15 



Thr Arg Met Pro Pro Trp Ala Gly 
20 

Ser lie Asp Gly Arg Pre Val Leu 

35 40 

Tyr Glu Thr Val Ser Gly Thr Pro 
50 55 

Ala Ser Ala Ala Ala Ala Thr Ala 
65 70 

Phe Leu Ser Pro Leu Ala Ser Ser 

85 

Asp Asp Lys Leu Thr Ala Leu Leu 
100 

Glu Pro Asn Val Asp Ser Gin Gin 
115 120 



Val Arg Gin Asn Val Met Gly Ser 
25 30 

Pro Ala Asn Ser Thr Thr Leu Thr 
45 

Leu Glu Thr Ala Ala Ser Ala Ala 
60 

Arg Gly He Val Thr Asp Phe Ala 
75 80 

Ala Ala Ser Arg Ser Ser Ala Arg 
90 95 

Ala Gin Leu Asp Ser Leu Thr Arg 
105 HO 

Leu Leu Asp Leu Arg Gin Gin Lys 
125 
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Lys Lys Lys Lys Lys Lys 
130 



<210> 33 
<211> 150 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutant of Ad5 
pIX connected to the 7K binding moiety through the 
use of a spacer 

<400> 33 

Met Ser Thr Asn Ser Phe Asp Gly Ser lie Val Ser Ser Tyr Leu Thr 
1 5 10 15 

Thr Arg Met Pro Pro Trp Ala Gly Val Arg Gin Asn Val Met Gly Ser 
20 25 30 

Ser lie Asp Gly Arg Pro Val Leu Pro Ala Asn Ser Thr Thr Leu Thr 
35 40 45 

Tyr Glu Thr Val Ser Gly Thr Pro Leu Glu Thr Ala Ala Ser Ala Ala 
50 55 60 

Ala Ser Ala Ala Ala Ala Thr Ala Thr Gly lie Val Thr Asp Phe Ala 
65 70 75 80 

Phe Leu Ser Pro Leu Ala Ser Ser Ala Ala Ser Arg Ser Ser Ala Arg 

85 90 95 

Asp Asp Lys Leu Thr Ala Leu Leu Ala Gin Leu Asp Ser Leu Thr Arg 
100 105 110 

Glu Pro Asn Val Asp Ser Gin Gin Leu Leu Asp Leu Arg Gin Gin Val 
115 120 " 125 

Ser Ala Leu Lys Gly Ser Ser Ala Ser Ala Ser Ala Ser Ala Ser Lys 
130 135 140 

Lys Lys Lys Lys Lys Lys 
145 150 



